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.r^rt^J Summary If show: 1 200 §j|s^ 7| ,^^^4!^,; .f fiirt 5[ 

Items 1-83 of 83 One page. 

n 1 : .!>Mkoiio .C, L 111 A,.Sp^^ '---"^^^ 

BOral fibroblast expression of wound inducible transcript-3.0 (wit3.0) 
accelerates the collagen gel contraction in vitro. 

J Biol Chcm. 2(K}3 Oct 3 [Epub ahead of printj 
PMID: 14527947 [PubMed - as supphed by pubHshcr] 



n 2: lag C, 



Rekilecf A-ticles, Links 



Constipation, polyps, or cancer? let PTEN predict your future. 

Am J Med Gcncl. 2003 Nov 1;122A(4):3 15-22. 
PMID: 14518069 fPubMcd - in process] 



03: Juha n -Vag uc K Morange PE, Alc ssi MC . Related Articles, Links 

The insulin resistance syndrome: implications for thrombosis and 

cardiovascular disease. 
Pathophysiol Ilacmost Thromb. 2002 Sep-Dcc;32(5-6):269-73. 
PMID: 13679655 [PubMed - in process) 

fj 4. Marcha iKl-Adan) S. M3rc h<^I J. Co ha) M. Soier R C rerard B. Casticr Related Articles, Links 
* Y, Leseche G. Vaicyre IX ?viai H. Aub ier M. !)chou\ Cresta ni B 

Defect of hepatocyte growth factor secretion by fibroblasts in idiopathic 
pulmonary fibrosis. 

Am J Rcspir Crit Care Med. 2003 Aug 28 [Epub ahead of print] 
PMID: 12947024 fPubMcd - as supplied by publisher] 



rn 5: Dayies JA, 



Related Aitlcies, Links 



Morphogenesis of the metanephric kidney. 
ScientificWorldJoumal. 2002 Jun 28;2(6): 1 937-50. 
PMID: 12920322 [PubMed - in process] 

G 6: Xu L. Alarco n C. C ol S, Massaaic J. 



Related Articles. Links 



Distinct domain utilization by Smad3 and Smad4 for nucieoporin interaction 
and nuclear import. 

J Biol Chem. 2003 Aug 13 |Hpub ahead of pnnt] 
PMID: 12917407 [PubMed - as supplied by publisher] 



i 7: I.i 0. W,-3na S. GelehUcr TP 



Related Articles. Links 



Identification of glucocorticoid receptor domains involved in transrepression 
of TGFbeta action. 

J Biol Chcm. 2003 Aug 5 [Hpub ahead of print ) 
PMID: 12902338 [PubMed - as supplied by publisher] 

n 8: Zhou 1 }. I.amont SJ. Reistec; Articles. Links 

B Association of transforming growth factor beta genes with quantitative trait 
loci for antibody response kinetics in hens. 

Anim Genet. 2003 Aug;34(4):275-82. 
PMID: 12873215 [PubMed - in process] 
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F<c*nated Articles, Links 



n 9; Mow A. Van Je 

B Higher stromal expression of transforming growth factor-beta type II 
receptors is associated with poorer prognosis breast tumors, 
Brcasl Cancer Res Treat. 2003 MaY,79(2): 149-59. 
PMID: 12825850 fPubMed - in process] 

n 10: Ryuli.Kem.SE, Related AritcU^s. Links 

The essential similarity of TGFbeta and activin receptor transcriptional 

responses in cancer cells. 

Cancer Biol Ther. 2003 Mar-Apr;2(2): 164-70. 
PMID: 12750556 [PubMed - in process] 

J J . Shin HI). Park HI.., Kim Lil, .lunj;. JH, Kim J \\ Yoon Ji L Kim Y,L Rf^iated Arlir:jes Links 

3 Interleukin 10 haplotype associated with increased risk of hepatocellular 

-I carcinoma. 

Hum Mol Genet. 2003 Apr 15; 12(8): 90 1-6. 
PMID: 12668613 [PubMed - in processj 



R^j|at«d Aj'licle$. Links 



1 2 : Mohen i A. W^^rd WI\ Ps'ao CI I. Tuv lor J> Small W Jr. r^ri/io- 

Vloltcni L. Vcno PA 

^ Cytostatic properties of some angiotensin 1 converting enzyme inhibitors 
H and of angiotensin II type I receptor antagonists. 

Curr Pharm Des. 2003;9(9):75 1 -61 . Review. 

PMID; 12570792 [PubMed - mdexed for MHDLINI^] 

13« H ohcn stein P. Molcnaar I... Hls ing a J, Mo rr cau H, van dcr Klift I K I eted Afticies, Links 
* Struii k A. Jagrnohan-Changur S. S mits R. van Kranen F-L van 
Ommeti GJ. Cornel iss c C > Dc vi lce P. Foddc R. 



Serrated adenomas and mixed polyposis caused by a splice acceptor 
deletion in the mouse Smad4 gene. 

Genes Chromosomes Cancer. 2003 Mar;36(3):273-82. 
PMID: 12557227 [PubMed - indexed for MJiDLINE] 



□ 14: IJ.uvd RV. Vidnl I loi-va tb H. Kovacs K. Scheithauc rB Kelated Articles. Links 

EAngiogenesis in normal and neoplastic pituitary tissues. 
Microsc Res Tech. 2003 Feb 1 ;60(2):244-50. Review. 
PMID: 12539179 [PubMed - indexed for MI'DLINH] 

n 15: Jordan GR. Loveridgc Power J> Clarke MT. Parker M. Reeve J. Rel3tec5 Articles, Links 

BThe ratio of osteocytic incorporation to bone matrix formation in femoral 
neck cancellous bone: an enhanced osteoblast work rate in the vicinity of 
hip osteoarthritis. 

Calcif Tissue Int. 2003 Mar,72(3): 190-6. lipub 2003 Jan 21. 
PMID: 12532281 [PubMed - in process] 

□ 16; Piard F. Martin I.., Chapusot C. Porincile T. Fnivre J. RelatecJ Articles, bnks 

B [Genetic pathways in colorectal cancer: interest for the pathologist] 
Ann Pathol. 2002 Sep;22(4):277-88. Review. French. 
PMID: 12410150 [PubMed - indexed for MHDLINEJ 



n 17: Bach.JF 



Relsled Articles, Lnkii 



n 



?| [Current concepts of autoimmunity] 
J Rev Neurol (Paris). 2002 Oct,158(10 Pt l):881-6. Review. French. 
PMID: 12407294 [PubMed - indexed for MHDLINEJ 

1 8 • Barom .;L RelciUvJ .A!licl<?s, Links 

' StnheHini G. Baldacci i\ Locci P. 

Crouzon's syndrome: differential in vitro secretion of bFGF, TGFbeta 1 
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isoforms and extracellular matrix macromolecules in patients with FGFR2 
gene mutation. 

Cytokine. 2002 Jul 21;19(2):94-101. 

PMID: 12182844 [PubMed - indexed for MEDLINE] 



19: l.iovd RV, Fcrrciro JA. Jm L. Sebo TJ. 



Reiated Articles, Unks 



TGFB, TGFB Receptors, Ki-67, and p27(Kip)l Expression in Papillary 

Thyroid Carcinomas. 

Cndocr Palhol. 1997 Wintcr;8(4):293-300. 

PMID: 121 14790 [PubMed - as supplied by publisher] 



n 20: Oian X. .lin L, Lloyd RV. 



Relateci Articles, bnks 



Expression and Regulation of Transforming Growth Factor Bl in Cultured 
Normal and Neoplastic Rat Pituitary Cells. 

Endoer Pathol. 1996 Spnng;7(l):77-90. 

PMID; 121 14683 [PubMed - as supplied by publisher] 



n21: Lli>3jdiiVL 

B Growth Factors. 
Endoer Palhol. 1997 Sumnier;8(2):121-127. 
PMID: 121 14632 [PubMed - as supplied by publisher] 

n 22: .Bauer G, 



Relaiod Articles, Unks 



□ 23: 

B 

□ 24; 



Signaling and proapoptotic functions of transformed cell-derived reactive 

oxygen species. 

Prostaglandins Leukot Essent Pattv Acids. 2002 Jan;66(l):41-56. Review. 
PMID: 12051956 [PubMed - indexed for MEDLINE] 

^|i/.ui.^.ni.li..FkUife^ Rel<iied Afticlos, Links 

SawaJa 1'. Fukuinoto 1' 

Inflammatory and anti -inflammatory cytokines regulate the recovery from 

sublethal X irradiation in rat thymus. 

Radial Res. 2002 Mar;157(3):281-9. 

PMID: 1 1839090 [PubMed - indexed for Mh:DLINEJ 



□ 25: 



Picemno R. Puopolo M. Riuaull De La Longrais 1A> Fracchioli 
Massd7rio.M..Kats^^^^^^^ 

[Growth factors in epithelial ovarian cancer] 

Minerva Ginecol. 2002 Feb;54(l):33-52. Review. Italian. 
PMID: 1 1828269 [PubMed - indexed for MliDLINEJ 

Saika S , Saika S . Liu CY. A'/h ar M . Sanford U\ Doetsch man L 
Gendron RL. K ao CW. Kao WW 

TGFbeta2 in corneal morphogenesis during mouse embryonic 
development. 

Dev Biol. 2001 Dec 15;240(2):4 19-32. 

PMID: 1 1784073 [PubMed - indexed for MEDLINE] 



Related A.'lides, bnks 



Related Articles. Unks 



n26: 



Rt:i«t«d A: llCiti*. L:Mk:> 



Paicdcr DH. Shcu Ij. O 'Kccfc R.). Pu/.a s JE. Sc hwar/i li'iVL 
Constnic I..S. O kunicff P. R osier R.N 

BRole of pentoxifylline in preventing radiation damage to epiphyseal growth 
plate chondrocytes. 

Radial Res. 2002 Jan;157(l):62-8. 

PMID: 1 1754643 [PubMed - indexed for MEDLINE] 

□ 27: Urban (t. Vlannoni F^i lono R. Lucchitn C. A]o P.. ITi Tondo IJ. Related Articles. Links 
11=1 New placental factors: Between implantation and inflammatory reaction. 

LJ Early Pregnancy. 2001 Jan;5(l):70-L 

PMID: 1 1753520 [PubMed - indexed for MEDLINE] 
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n 28: Biijcnof R 



ReistfKi Articltt^, Links 



Endocrine, paracrine and autocrine regulation of trophoblastic 

metalloproteinases. 

Early Pregnancy. 2001 Jan;5(l):30-1. 

PMID: 1 1753501 [PubMed - indexed for MEDLINE] 



1*129: LMaJ^^Oiciiiam CO 

□ 



Rolateci Articles;. Link-; 



TGFB-responsive human trophoblast-derived cell lines. 

Placenta. 2001 Nov,22(10):889-90. No abstract available. 
PMll): 1 1718579 [PubMed - indexed for MHDLINB] 



□ 30 



Related Articles. Links 



M isiakos J.vP, Kouraklis G, .Agapito s H.. Pc r rea O, Karatzas G, 
B oudoula s 11.. Kar ayannakos PE. 

Expression of PDGF-A, TGFb and VCAM-1 during the developmental 

stages of experimental atherosclerosis. 

Eur Surg Res. 2001 Jul-Aug;33(4):264-9. 

PMID: 1 1684832 fPubMcd - indexed for MEDLINE] 



n31: Bauer (r. 



Related Articles. Linkr> 



Lactobacilli-mediated control of vaginal cancer through specific reactive 
oxygen species interaction. 

Med Ilypolheses. 2001 Aug;57(2):252-7. 

PMID^l 1461 184 [PubMed - indexed for MEDLINE] 



Kelated Arlicie*. Link-> 



n32: Bauer a 

B Simplicity through complexity: immunoblot with recombinant antigens as 
the new gold standard in Epstein-Barr virus serology. 

Clin Lab. 2001 ;47(5-6):223-3(). Review. 

PMID: 1 1405600 [PubMed - indexed for MHDL1NL| 

n 33* :!<?M^:>n..!L..AlhrcdU Related Arilcies. Links 

' Johansson B, Iloglund M. 

B Altered expression of TGFB receptors and mitogenic effects of TGFB in 
pancreatic carcinomas. 
IntJ Oncol. 2001 Jul;19(l):71-81. 
PMID: 1 1408925 [PubMed - indexed for MEDLINE] 



Reflated Articles, Links 



f : 34. Paleder DB. Eliseev RA. O'Kecfc RJ. Schwarz EM. Okunieff P. 

BThe role of autocrine growth factors in radiation damage to the epiphyseal 
growth plate. 

Radial Res. 2001 Jun;155(6):847-57. 

PMID: 1 1352768 [PubMed - indexed for MHDLINE] 



□ 35: Mould TL. Robcrts-Thoms on PJ. 

Pathogenesis of telangiectasia in scleroderma. 
Asian Pac J Allergy Immunol. 2000 Dec;18(4): 195-200. 
PMID: 1 1316039 [PubMed - indexed for MHDLINiij 

□ 36: Str;md V. 

B Monoclonal antibodies and other biologic therapies. 
Lupus. 2001;10(3):216-21. Review. 
PMID: 1 1315356 [PubMed - indexed for MEDLINE] 



ReUilod A:ticles, Links 



Related Articles, Links 



n37: Pros/ C 

□ 



F^elated Articles, Links 



Alloregulatory T cells producing TGFB/ILIO can be generated ex vivo. 

J Heart Lung Transplant. 2001 Feb;20(2);229. No abstract available. 
PMID: 1 1250438 [PubMed - as supplied by publisher] 
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n 38: ^Mdi.Ii.&>>>;i-MG,.Mm^^ y<e\i)tiid Articles. IJnks 



Effect of familial hypertension on glomerular hemodynamics and tubulo- 

glomerular feedback after uninephrectomy. 

Am J 1 ly pcrlcns. 200 1 I^'cb; 1 4(2): 121-8. 

PMID: 1 1243302 fPubMcd - indexed for MtlDLlNE] 



n 39: Haiicr a 



Keiated A^iicle??. Links 



Reactive oxygen and nitrogen species: efficient, selective, and interactive 
signals during intercellular induction of apoptosis. 

Anticancer Res. 2000 Nov-Dcc;20(6B):4n 5-39. Review. 
PMID: 1 1205238 fPubMed - indexed forMEDLINH] 



(j 40: V^ourhts J, f.,artig au Yi. 

[Biomodulation and radiotherapy] 

Cancer Radiolher. 2000 N()v;4 Suppl l:128s-133s. Review. French. 
PMID: 1 1 194950 [PubMed - indexed for MEDLINH] 



Kelated Arlicie:?. Links 



041: Robbins ME. O'Mnliev Y. Zhao W, Davis CS. Bonsih SM. 



Related Articles. Links 



The role of the tubulointerstitium in radiation-induced renal fibrosis. 

Radial Res. 2001 Mar;155(3):481 -9. 

PMID: 1 1 182800 [PubMed - indexed for MHDLINEJ 



U2; 



Relciied Articles, Lsnks.- 



□ 43: 



rr> c CA. Yocun) DE. TucUi R. Suvana E. Senor ]iA. Scrtor Rll 
Khalkhah -Hlli s Mcxne ri.., Hendrlx MJ. 

An in vitro Model for Studying Mechanisms Underlying Synoviocyte- 

Mediated Cartilage Invasion in Rheumatoid Arthritis. 

Pathol Oncol Res. 1996;2(3); 157-166. 

PMID: 1 1 173599 [PubMed - as supplied by publisher] 

Chakrabartv S, l..iu BR. Raiaoopa] S. Related Articles. Links 

Disruption of transforming growth factor beta-regulated laminin receptor 
function by expression of antisense laminin, a chain RNA in human colon 

cancer cells. 

J Cell Physiol. 2001 Jan;]86(I):47-52. 

PMID; 1 1147813 [PubMed - indexed for Ml iDL INI i] 



□ 44: 



Rcjiatejd Article;?. Links 



□ 45: 



l:*aUjk.i),.Sabourin.jC 
Dacreiix M. 

[Response of metastatic colorectal cancers to treatment with CPTl 1 

(irinotecan); implications of the mismatched base repair system] 
Gastroenterol Clin Biol. 2000 Oct;24(10):917-22. French. 
PMID: 1 1084429 [PubMed - indexed for MEDLINE] 

Seflel A D, Maclenna n GL Ch en ZJ. I .iu Feiguso ii K, Deorco ( F<e^lBt(Ki Arlicles. Links 
Levine F, Hamrel N. Ho-Chang C. 

Loss of TGFbeta, Apoptosis, and Bcl-2 in Erectile Dysfunction and 
Upregulation of p53 and HIF-1 alpha in Diabetes- Associated Erectile 
Dysfunction. 

Mol Urol. 1999;3(2):103-107. 

PMID: 1085 13 11 [PubMed - as supplied by publisher] 



C 46: Lee SiiJJJX^,:( an jJ[T 



R(>l?:ie(3' A'ticles, Links 



Suppression of TGF-beta signaling in both normal conjunctival fibroblasts 

and pterygial body fibroblasts by amniotic membrane. 

Curr Hye Res. 2000 Apr;20(4):325-34. 

PMID: 10806447 [PubMed - indexed for MEDLINIiJ 



0 47: Zheng 1 1. Wang J. Hauer -Jensen M. 



Related Articles. Links 
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0 



Role of mast cells in early and delayed radiation injury in rat intestine. 

Radial Res. 2000 May; 153(5 Pt l):533-9. 
PMID: 10790274 [PubMed - indexed for MEDLINH] 

r 48: Cheng EY , Decker RS. Lee C. ReiatecJ Articles. Links 

Role of angiotensin II in bladder smooth muscle growth and function. 



Adv Exp Med Biol. 1 999;462: 1 83-91 . Review. 
PMID: 10599423 [PubMed - indexed for MHDLINP:j 

n 49: 0;Miil{ey Y^ZIm). Relsi^(i A-tloles, L.!nks: 

B Radiation-induced alterations in rat mesangial cell Tgfbl and Tgfb3 gene 
expression are not associated with altered secretion of active Tgfb 
isoforms. 

Radial Res. 1999 Dec;152(6):622-8. 

PMID: 10581532 [PubMed - indexed for MHDLlNlil 

n 50: Fogar P, Pasquali C. Dasso D. Floreani A. Piva MO. Do Paoli M, Ral^ai^d A'tlcleji, !..:nk$i 
' Meli s A. Spcni C. Pcdraz/oli S. Plebani M 

B Transforming growth factor beta, fibrogenesis and hyperglycemia in 
patients with chronic pancreatitis. 

J Med. 1998;29(5-6):277-87. 

PMID: 10503164 [PubMed - indexed for MEDLINEl 

f^Sj. Peng MM, Hllonrieder V. Wallrapp C > Mullcr-PtUasch F. Sonimcr Related Articies. Links 

BUse of representational difference analysis to study the effect of TGFB on 
the expression profile of a pancreatic cancer cell line. 
Gene.s Chromosomes Caneer. 1999 Sep;26(l):70-9. 
PMID: 10441008 [PubMed - indexed for MEDLINH] 

n 52: Zhao W,.aMjil!<^:..Y,. R^ Rolateci Arllciei>. Links 

Irradiation of rat mesangial cells alters the expression of gene products 
1 — I associated with the development of renal fibrosis. 

Radial Res. 1 999 Aug; 1 52(2): 1 60-9. 

PMID: 10409325 [PubMed - indexed for MEDLINE] 



(j 53. Hashinioic^S,. SM^^^^ Related Article*. Links 

' Matushila K. Kobayashi M. Miyasaka K 

The suppression of metastases and the change in host immune response 
after low-dose total-body irradiation in tumor-bearing rats. 

Radial Res. 1 999 Jun; 1 5 1 (6):7 1 7-24. 
PMID: 10360792 [PubMed - indexed for MEDLINH] 



fj 54. Jonson T. Goruriova L. Dawiskiba S. Andren-Sandber^ A, Reiated Articles., Links 

' .SlmmmiL.ten.l)jike.P,.J^^ 

BlVIolecular analyses of the 15q and 18q SIVIAD genes in pancreatic cancer. 
Genes Chromosomes Cancer. 1999 Jan;24(l):62-71 . 
PMID: 98921 10 [PubMed - indexed for MEDLINE] 

□ 55: Mu Z. Rolircr OA, Stone RT. Murtaugli MP. Beattic CW . Relsted Articles. Liijks 

B Genomic mapping of chemokine and transforming growth factor genes in 
swine. 

Mamm Genome. 1997 Apr;8(4):246-9. 

PMID: 9096103 [PubMed - mdexed for MEDLINH] 

O 56: Mev J. Thanus S. Relsted Articles. Links 

Pi Functional and biochemical analysis of CNS-relevant neurotrophic activity 
in the lesioned sciatic nerve of adult rats. 

JHirnforsch. 1996;37(1) 25-50 
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PMID: 8964975 fPubMed - indexed for MEDLINHl 

□ 57: Acc CI. Okiilicz WC. Relsted Afticies, Links 

BDifFerential gene regulation by estrogen and progesterone in the primate 
endometrium. 

Mol Cell Endocrinol. 1995 Nov 30,1 15(1):95-103. 
PMID: 8674869 [PubMcd - indexed for MHDIJNEJ 

□ 58: Spanakis P^routv-Bove D. Related Articles, Links 

Quantitative variation of proto-oncogene and cytokine gene expression in 
isolated breast fibroblasts. 

Int J Cancer. 1995 May 29;61(5):698-705. Erratum in: Inl ,1 Cancer 1995 Nov 15;63 
(4):609. 

PMID: 7768644 [PubMcd - indexed for MEDLINE] 

p 59. Bhaltacharyva N, R^nisammy R,.Ealiiia.n E, liollis VW^ Anderson p^>j?;:(:>(j A:t{cl<^:s Unks 
' WA. 

Protooncogene, growth factor, growth factor receptor, and estrogen and 
progesterone receptor gene expression in the immature rat uterus after 
treatment with estrogen and tamoxifen. 
J Submicrosc Cytol Pathol. 1994 Apr;26(2): 147-62. 
PMID: 8019941 [PubMcd - indexed for MEDLINE] 

' 60* ^*^illi'^"^^ AC. [ lagiic A> Manning AM. Vim der Stappen JVV. Related Articles. Links 

' .ll4.raskeya.C.. 

In vitro models of human colorectal cancer. 
Cancer Sun^ 1993;16:15-29. Review. 
PMID: 8348534 [PubMed - indexed for MEDLINEJ 

□ 61: MathemLS^Vale.WW Related Articles. Links 

B Molecular and functional characterization of activin receptors. 
Receptor. 1993 Fall;3(3): 173-81 . Review. 
PMID: 8167568 [PubMed - indexed for MEDLINEJ 

n 62: Allann ic IL Guilhc n) L Maugcnd rc D. Related Articles. Links 

[Is thyroid hormone useful in the prevention of nodular recurrence after 
h em i thy r oi d ect om y ? ] 

Ann Endocrinol (Paris). 1993;54(4):286-90. Review. French. 
PMID: 8092805 [PubMed - indexed for MEDLINE] 



G 63: -Stampfcr MR. Yaswcn P. Related Articles, Unks 

Culture systems for study of human mammary epithelial cell proliferation, 



B 



differentiation and transformation. 
Cancer Surv. 1993;18:7-34. Review. 
PMID: 8013001 [PubMed - indexed for MEDLINEJ 

p 64* Hendricks-TaviorLR. Bachinskl LL. Siciliano ML Fcrtittn A. Related Articles, Unks 
.Tr.^sk. 13., . dc Jon .(^L. I /Mbcttcr DhJ. .larjj n i^ton. GL 

The CCAAT/enhancer binding protein (C/EBP alpha) gene (CEBPA) maps 
to human chromosome 19ql3. 1 and the related nuclear factor NF-IL6 
(C/EBP beta) gene (CEBPB) maps to human chromosome 20ql3. 1 . 

Genomics. 1992 Sep; 14(1): 12-7. 

PMID: 1427819 [PubMed - indexed for MEDLINEJ 

n 65: L<x.SW,.Tgmasca^^ Related Articles. Links 

Ijg Transcriptional downregulation of gap-junction proteins blocks junctional 
1 — I communication in human mammary tumor cell lines. 

J Cell Biol. 1 992 Sep; 1 1 8(5): 1213-21. 

PMID: 1324944 [PubMed - indexed for MEDLINE] 
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n 66: MiUer MA,.Bt)khari.M,..Qci& 



Relented Articles, Unk:> 



Simultaneous assessment of TGFB and cell cycle kinetics using 
lUdR/BrdU infusions in human neoplasms from plastic-embedded tissue. 
Jllistochem Cytochcm. 1992 Mar;40(3):427-30. 
PMID: 1313063 [PubMed - indexed for MEDLINH] 



n 67: PhiUips.GO^.White^^ 



Related Arlicies, Links 



Inhibition by methylprednisolone acetate suggests an indirect mechanism 

for TGF-B induced angiogenesis. 

Growth Factors. 1992;6(l):77-84. 

PMID: 1375479 [PubMed - indexed for MEDLINE] 



n 68: Cooinbes RC, Ba iTctt-L cc P., Luqni am Y. 

Growth factor expression in breast tissue. 

J Steroid Biochem Mol l^iol. 1990 Dec 20;37(6):833-6. 
PMID: 2285595 [PubMed - indexed for MHDLINEl 



Kelated Articles. Unks 



□ 69: 



Streile in JW. Grainmer S F, Yosh ika wa T> D cniidctn A. Ve miccr Related Anlri^s Links 
.M. 

Functional dichotomy between Langerhans cells that present antigen to 
naive and to memory /effector T lymphocytes. 

Immunol Rev. 1990 Oct;l 17:159-83. Review. 
PMID: 2258190 [PubMed - indexed for MEDLINE] 



G 70: Strain AIHiiLDl 



Kelated Arlicle;>. Links 



Changes in sensitivity of hepatocy tes isolated from regenerating rat liver to 
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25 


936 


2 


78 


2 


1569 


6 


106221 


106221 Sequence 3 


26 


927 


.8 


77 


5 


1561 


6 


108275 


108275 Sequence 3 


27 


891 


74 


4 


1597 


10 


AF191297 


AF191297 Cavia por 


28 


854 


.2 


71 


4 


1585 


10 


RNTGFBl 


X524 98 Rat mRNA fo 


29 


847 


8 


70 


8 


1579 


10 


MUSTGFRNA 


M13177 Mouse trans 


30 


847 


.8 


70 


8 


1641 


10 


BC013738 


BC013738 Mus muscu 


31 


847 


. 8 


70 


8 


2094 


10 


MMU0 09862 


AJ009862 Mus muscu 


32 


805 


.6 


67 


3 


1117 


4 


BOVTGFB 


M36271 Bovine tran 


33 


804 


.6 


67 


.2 


1125 


10 


AF480858 


AF48 08 58 Sigmodon 



1 A 


bo A 


57 . 




T C7 C 
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a 
o 


T n "3 "2 1 rk 


iUoolu sequence j. 


n: 


b j4 
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IJ /b 


b 


AADZ O D J JJ 




J o 




54 . 


r- 
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T "3 O Q 
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b 
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AAbzobxy bequencc 


J / 




A Q 


A 


1 7 R 9 


D 






38 


586.6 


49. 


.0 


1350 


6 


AX528615 


AX528615 Sequence 


39 


570.6 


47. 


.7 


699 


6 


105434 


105434 Sequence 4 


40 


344 


28. 


,7 


489 


6 


AX455100 


AX455100 Sequence 


41 


340.4 


28. 


.4 


469 


10 


MATGFBl 


X60296 M.auratus m 


42 


329.2 


27. 


.5 


1256 


5 


CHKTGFB4 


M31160 Gallus gall 


43 


310.2 


25. 


.9 


339 


6 


AR036686 


AR036686 Sequence 


44 


309.6 


25. 


. 9 


374 


4 


AF349538 


AF349538 Canis fam 


45 


308.6 


25, 


.8 


339 


4 


OCAF000133 


AF000133 Oryctolag 



ALIGNMENTS 



RESULT 1 
AX338213 

LOCUS AX338213 1326 bp DNA linear PAT 09-JAN-2002 

DEFINITION Sequence 1 from Patent WO0181404. 
ACCESSION AX338213 

VERSION AX338213.1 GI: 18128750 

KEYWORDS 

SOURCE Sus scrofa (pig) 

ORGANISM Sus scrofa 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 
REFERENCE 1 

AUTHORS Strober.W., Nakamura.K., Kitani,A. and Fuss, I. J. 

TITLE Inducible plasmid vector encoding tgf-_g(b) and uses thereof 

JOURNAL Patent: WO 0181404-A 1 Ol-NOV-2001; 

THE SECRETARY OF THE DEPARTMENT OF HEALTH AND HUMAN SERVICES (US) 
FEATURES Loca t ion/Qual i f iers 

source 1. .1326 

/organism="Sus scrofa" 
/mol__type= "genomic DNA" 
/ db_xr e f = " t axon : 9 8 2 3 " 
CDS 16. .1188 

/not e= "unnamed protein product" 
/codon_start=l 
/protein_id="CAD20538 .1" 
/db_xref ="GI : 18128751" 

/translation="MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 

BASE COUNT 263 a 438 c 392 g 233 t 

ORIGIN 

Query Match 94.9%; Score 1135.8; DB 6; Length 1326; 

Best Local Similarity 97.8%; Pred. No. 6.3e-197; 

Matches 1171; Conservative 0; Mismatches 2; Indels 24; Gaps 1; 



Qy 


1 


ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

IIIIIIIIMIIIMIIIIIIIIIIIIMIIIMIIIIIMMIIIMIIIIIIIIIIII 

ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 


60 


Db 


16 


75 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 

IIIIIIIIIIIMIIMIIMIIMIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIM 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 


Db 


76 


135 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 


180 


Db 


136 


IIIIIIIMMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIII III 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 


195 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMMMIIIIIMIIIIMI llllllllllll 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 


240 


Db 


196 


255 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 


Db 


256 


IIIIIIIIIIIIIIIMIIIIIIIIIIMIIIMIIMIIIIIIIMIIIMIMMIM 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


315 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 


Db 


316 


II III lllllll III III IIIIIMIIII INI III llllllllllll llllllllllll 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


375 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 


Db 


376 


IIIIIIIMIIIIIIIMIMMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


435 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 


Db 


436 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


495 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 


Db 


496 


IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIMIIIIIIIIII 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


555 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

MIIIIIIIIMIIIMMIIMIIIIIIMIIIIMMIIIIMIIIIMMIIIIIII 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


600 


Db 


556 


615 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


660 


Db 


616 


IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


675 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 


720 


UD 


D /b 


IIIIIIIMIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 


/o b 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


780 


Db 


736 


MMMIIIMMIMIIMMIMIIIIIMIIMIMIIMMIIIMMIIIMMI 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


795 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 


840 


Db 


796 


IIIIIIMIIIIIIIIIMIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIII 

ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGA 


849 



Qy 



841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 



Db 



850 



llllllllllllllllllllllllllllllllllllllllll 
GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 8 91 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 



901 



892 



961 



TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IMIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 951 



GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
952 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1011 

1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
1012 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1071 

1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

IIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 
1072 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1131 

1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
1132 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1188 



RESULT 2 

AF461808 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



AF461808 2221 bp mRNA linear MAM 03-JAN-2002 

Sus scrofa transforming growth factor beta 1 (TGFBl) mRNA, complete 
cds . 

AF461808 

AF461808.1 GI:18042250 

Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 

1 (bases 1 to 2221) 

Wimmers,K., Chomdej,S., Ponsuksili , S . and Schellander , K. 
Polymorphism in the porcine transforming growth factor beta 1 gene 
Unpublished 

2 (bases 1 to 2221) 

Wimmers^K., Chomde j , S . , Ponsuksil i , S . and Schellander, K. 

Direct Submission 

Submitted (20-DEC-2001) Institute of Animal Breeding Science, 
University of Bonn, Endenicher Allee 15, Bonn 53115, Germany 

Locat ion/Qua 1 i f iers 

1. .2221 

/organism="Sus scrofa" 
/mol_type= " mRNA" 
/db_xref="taxon: 9823" 
/chromosome= " 6 " 
1. .2221 
/gene= "TGFBl" 
1. .1173 
/gene=" TGFBl" 
/note= " cytokine " 
/codon start =1 



/product= "transforming growth factor beta 1" 

/protein_id="AAL57902 .1" 
/db_xref ="GI : 18042251" 

/translation="MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
3'UTR 1174. .2221 

/gene=" TGFBl" 

BASE COUNT 504 a 616 C 649 g 452 t 

ORIGIN 



Query Match 94.5%; Score 1131; DB 4; Length 2221; 

Best Local Similarity 97.6%; Pred. No. 4.4e-196; 

Matches 1168; Conservative 0; Mismatches 5; Indels 24; Gaps 1; 



Qy 


1 


ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 


60 


Db 


1 


III llllllllllllllllllllllllllllllllllllllllllllllllllllllll 
ATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 


60 


Qy 

Db 


61 
61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 

IIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 
120 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 I 1 1 1 1 III 
1 M M M M M M 1 1 1 M M M M 1 M M 1 M M M M M M 1 M 1 M 1 M 1 M M Ml 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 


180 


Db 


121 


180 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 


240 


Db 


181 


llllllllllllllllllllllllllillllllllllllllllllll llllllllllll 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 


240 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 


Db 


241 


300 


Qy 
Db 


301 
301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

IIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 
360 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill Mill III! IIIIIIMIIIIIIIIIIIIIIIIIIII III lllllllllllll II 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 


Db 


361 


420 


Qy 
Db 


421 
421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

MIllllllllllillliMi MiiilllillllllllMlllllllllliliiiillll 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 
480 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

IMIIIIIIIMIMMMM MIIIIIIIMIIMMIMMMMIMIIIIIIIIM 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 


Db 


481 


540 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


600 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 



600 



Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


835 


Qy 


901 


Db 


877 


Qy 


961 


Db 


937 


Qy 


1021 


Db 


997 


Qy 


1081 


Db 


1057 


Qy 


1141 


Db 


1117 



601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 66 0 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIMIMIIIIIIIIMI 

601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

MIIMI lllll lllllllllllllllllllllllllllllllllllllllllllMI 
GCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

IIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIMIIIII 



834 



llllllllllllllllllllllllllllllllllllllllll 
-GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 876 



IIIIIIIIIIIIIIIIIMIIIMIIMIIIIIIIIIMIIIIIIIIIlllll 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIMIIIIMII 



GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

lllllllllllllllMIIIMIIIIIIIIMIIilllllllllllllllllllllllM 

GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1056 



IIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIMIIIMIIIIIMIIIIIIIIIIII 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 
AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1 1 

IIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 



RESULT 3 
PIGTGFBIA 

LOCUS PIGTGFBIA 3206 bp mRNA linear MAM 31-MAR-1995 

DEFINITION Sus scrofa transforming growth factor beta-1 mRNA, complete cds. 
ACCESSION M23703 
VERSION M23703.1 GI: 755044 

KETOORDS transforming growth factor-beta-1 . 
SOURCE Sus scrofa (pig) 

ORGANISM Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 
REFERENCE 1 (bases 1 to 3206) 

AUTHORS Kondaiah,P,, Van Obberghen-Schill ing, E. , Ludwig,R.L., Dhar,R., 

Sporn,M.B. and Roberts, A. B. 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



cDNA cl 
Evidenc 
J. Biol 
8905401 
2461367 
On Apr 
Origina 
mRNA. 



FEATURES 

source 



gene 
CDS 



polyA_i 

polyA_; 

BASE COUNT 
ORIGIN 



signal 
site 

645 



oning of porcine transforming growth factor-beta 1 mRNAs . 

e for alternate splicing and polyadenylation 

. Chem. 263 (34), 18313-18317 (1988) 

0 

1, 1995 this sequence version replaced gi: 341017. 

1 source text: Sus scrofa (strain miniature swine) cDNA to 

Locat ion/Qual i f iers 
1. .3206 

/ organism="Sus scrofa" 
/mol_type="mRNA" 
/strain="miniature swine" 
/db_xr e f = " t axon : 9 8 2 3 " 

/eel l_type= "peripheral blood lymphocyte" 
1. .3206 

/gene= " TGF -beta - 1 " 

906. .2078 
/gene=" TGF-beta-1" 
/ codon_start=l 

/product=" transforming growth factor -beta-1 " 
/protein_id="AAA64616. 1" 
/db_xref ="GI : 755045" 

/trans 1 a t ion= " MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
3186. .3191 
/gene= "TGF-beta-1" 
3206 

/gene=" TGF-beta-l" 
a 1041 c 924 g 596 t 



Query Match 94.5%; Score 1131; DB 4; Length 3206; 

Best Local Similarity 97.6%; Pred. No. 4.3e-196; 

Matches 1168; Conservative 0; Mismatches 5; Indels 24; Gaps 



1; 



Qy 

Db 



1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III IIIIIIMIIIIMIIIIMIIIIMIIIIIMIIIMIIIIIIIIIIMMMM 

906 ATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 965 



Qy 

Db 

Qy 

Db 

Qy 

Db 



61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

IIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIII 

966 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 1025 



121 



180 



GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

IIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIII III 

1026 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 1085 
181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 240 

IIIIIMIIIIIIIMMMIIIMIIIIIIIIIIIIIIIIIIIIII llllllllllll 

1086 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 114 5 



Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IIIIMIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIII 

Db 1146 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 12 05 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

IIIIIIIIIIIIIMMIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 1206 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 1265 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIMIMIIIIIIIIIIMIII 

Db 1266 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 1325 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 1326 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 1385 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIllllllllllllllllllllllllllllll 
Db 1386 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 144 5 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IMIIIIIIIMIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIII 

Db 1446 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 1505 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1506 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 1565 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

lllllll Mill IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIII 

Db 1566 GCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 1625 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 1626 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 1685 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIIMIMMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMM 

Db 1686 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGA 173 9 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 174 0 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 1781 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 1782 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 1841 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCLTACATCTGGAGCCTA 1020 

IMMMMIMIMMMMIIMIMMIIIMMIIMIIIMIIMMMIIMII 

Db 1842 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1901 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

IIIIIIIIIIIIIIIIIIMlllll lllll III IIIMIIIIIIIIIIIMIIIMIIII 

Db 1902 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1961 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 



Db 1962 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 2 021 

Qy 1141 AAGCCCAAGGTGGAGOiLGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 2022 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 2078 



RESULT 4 

SSTGFBR 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 
FEATURES 

source 



CDS 



mat_peptide 



misc feature 



SSTGFBR 1605 bp ruRNA linear MAM 27-MAR-1995 

Porcine mRNA for transforming growth factor-beta (TFG) precursor. 
YOOlll 

YOOlll.l GI:2129 
transforming growth factor-beta. 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 
1 (bases 1 to 1605) 
Derynck,R. and Rhee,L. 

Sequence of the porcine transforming growth factor-beta precursor 

Nucleic Acids Res. 15 (7), 3187 (1987) 

87174844 

3470708 

Location/Qualifiers 
1. .1605 

/organism="Sus scrofa" 
/mol_type="mRNA" 
/db_xref ="taxon: 9823" 
/tissue_type=" ovary" 
/clone_lib=" lambda gtlO" 
404. .1576 

/note="TFG-beta precursor (AA 1-390)" 

/codon_start=l 

/protein_id="CAA68291 .1" 

/db_xref="GI :2130" 

/db_xref =" SWISS -PROT: P07200" 

/translation= " MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMLESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
1238. .1573 

/product = "mature TFG-beta" 
647. .655 

N-glycosylation site" 



misc feature 



BASE COUNT 
ORIGIN 



298 a 



/note="pot . 
809. .817 
/note="pot . 
572 c 



N-glycosylation site" 
437 g 298 t 



Query Match 94.4%; 
Best Local Similarity 97.5%; 
Matches 1167; Conservative 



Score 112 9.4; DB 4 ; 
Fred. No. 9e-196; 
0; Mismatches 6; 



Length 1605; 
Indels 24; Gaps 1; 



Qy 


1 


ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

III MIIIIIIIIIIIIMIIIIilllllllllllllllllllllllllllllllllll 
ATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 


60 


Db 


404 


463 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 


Db 


464 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII lllllllll Mlllll 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


523 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

IMIMIIIIIIIIIIIIIIIIIMIIIllllllllllllllllllllllllllll III 
GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 


180 


Db 


524 


583 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 


240 


Db 


584 


lllllllllllllllllllllllllllllllllllllllllllllll llllllllllll 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 


643 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

IMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIII 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 


Db 


644 


703 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 


Db 


704 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM IIIIIIIMIIIIIIIMII 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGCTGGAAAGCGGCAACCAAATC 


763 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

MMIIIIIIIIMMIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIMMIIIIIIII 

TATGATAAATTCAAGGGCACCCCCCAO^GCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 


Db 


764 


823 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 


Db 


824 


883 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

IlilllMIIIIIIIIIIMMIIMIIIIIMIIIMIIIIIIIIIMIIIMIMIII 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 


Db 


884 


943 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

IIIIIIIIIIIIIIIIIIIMIIMIIIIIIMIIIMIIMIIIIIIIIIIIIIIIIII 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


600 


Db 


944 


1003 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

inillllllllllllllllllllllllllllllllllMIIIIIIIIIIIIIMIIIII 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTa^GT 


660 


Db 


1004 


1063 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

1 1 1 f M 1 (MM 1 M M M M M M M M M M M M M M M M M M M 1 M M M 


720 


Db 


1064 


M 1 M 1 1 Mill 1 M II 1 II 1 II 1 1 M 1 1 M II M M 1 1 M M 1 1 1 M 1 1 M M II 1 1 
GCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 


1123 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

IIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIMIIIIIMIIIIIMIIIIIII 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


780 


Db 


1124 


1183 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 


840 


Db 


1184 


IIIMIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCrCCCGGCACCGCCGA 


1237 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 



841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

IIIMIillMIIMIIIIIIIIIIIIIMMIMIIIMM 

1238 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 1279 



901 



960 



TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

IMIIIIIIIIIIIMIIIIIlllllllllllllllllllllllllllllllllllllll 
1280 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 1339 

961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIMIIIIIIIIIIIIIII 

134 0 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 13 99 
1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

IIIMMIMIIMIMilllMllillllMIIIMIIIIIMIIIIIIIIIMIIIII 

1400 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1459 
1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

MIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIMIIIIIIIIIIIII 

1460 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1519 
1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIMIIIIIIIIIIIIIMIIIIIMIIIIIIIIIMIMIIIIMMIIIIIII 

1520 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1576 



RESULT 5 
GGTGFBl 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



Chordata; Craniata; Vertebrata; 
Cetartiodactyla; Suina; Suidae; 



Euteleostomi ; 
Sus . 



GGTGFBl 1750 bp mRNA linear MAM 27-MAR-1996 

Porcine mRNA for transforming growth factor-beta 1. 

X12373 

X12373.1 GI:63808 
transforming growth factor-beta 1. 
Sus scrofa (pig) 
Sus scrofa 
Eukaryota; Metazoa; 
Mammalia; Eutheria; 

1 {bases 1 to 1750) 

Jakowlew.S.B. , Dillard, P, J. , Sporn,M.B. and Roberts, A. B. 
Nucleotide sequence of chicken transforming growth factor-beta 1 

(TGF-beta 1) 

Nucleic Acids Res. 16 (17), 8730 (1988) 
88335639 
3166520 

2 (bases 1 to 1750) 
Jakowlew,S.B. 
Direct Submission 

Submitted (14-JUL-198B) Jakowlew S.B., National Institute of 
health, National Cancer Institute, Laboratory of Chemoprevention, 
Building 41, Room B902, Bethesda, Maryland 20892, USA 
The submitters believe that the chicken cDNA library was 
contaminated with porcine cDNA, and that the sequence is infact 
porcine TGF-beta-1. 27-MAR-1996. 

Location/Qualifiers 

1. .1750 

/organism="Sus scrofa" 
/mol_type=" mRNA " 
/strain="white leghorn" 



/db_xref ="taxon : 9823 " 

/clone="pTGFB-ChX119" 

/cell_type=" chondrocyte" 
gene 1. .1750 

/gene="TGF-beta 1" 
5»UTR 1. .446 

/gene="TGF-beta 1" 
CDS 447. .1622 

/gene="TGF-beta 1" 

/codon_start=l 

/product =" trans forming growth factor" 
/protein_id="C:AA30933 .1" 
/db_xref="GI : 63809" 
/db_xref ="SWISS-PROT: P07200" 

/ 1 rans 1 a t ion= MP PSGPGLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWGYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINAGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSR 
HRRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLD 
TQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
mat_peptide 1285. .1619 

/gene="TGF-beta 1" 
3'UTR 1620. .1750 

/gene="TGF-beta 1" 

BASE COUNT 325 a 627 c 479 g 319 t 

ORIGIN 



Query Match 92.9%; Score 1111.6; DB 4; Length 1750; 

Best Local Similarity 97.0%; Pred. No. 1.6e-192; 

Matches 1164; Conservative 0; Mismatches 9; Indels 27; Gaps 2; 



Qy 


1 


ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 


60 






III llllllllllll lllllllllllllllllllllllllllllllllllllllll 




Db 


447 


ATGCCGCCTTCGGGGCCTGGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 


506 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 




Db 


507 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


566 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 


180 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII III 




Db 


567 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 


626 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 


240 






IIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII llllllllllll 




Db 


627 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 


686 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 




IIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIMIIMIIIIIIIIIIIIMIIIIII 




Db 


687 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


746 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 




IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 




Db 


74 7 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


806 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMII 

Db 807 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 866 



Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIMIMIIIIIIIIIIIIIII 

Db 867 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 926 



Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMII II 

Db 927 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGGGC 986 



Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 
Db 987 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 1046 



Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IMIIIIIIIIIMIIIIIIIIIMIIIIIIIMMIIIIMIIIIIIIIMIIMIIII 

Db 1047 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 1106 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAAC---GGGTTC 717 

iiiiiii Mill iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii mill 

Db 1107 GCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGCAGGGTTC 1166 

Qy 718 AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 777 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIII 

Db 1167 AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 1226 

Qy 778 CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGAC 837 

lllllllllllllllMIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIII 

Db 1227 CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGA--- 1283 



Qy 838 TACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAG 897 

lllllllllllllllllllllllllllllllllllllll 
Db 1284 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAG 1322 



Qy 8 98 AACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATT 957 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIII 
Db 1323 AACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATT 1382 

Qy 958 CATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGC 1017 

IIMIIIIIIIIMMIMIIIIIIIMMIIIIIIIIIMIMIIMMMIIIMIM 

Db 13 83 CATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGC 1442 



Qy 1018 CTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCG 1077 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII IIIIIII III 

Db 1443 CTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCG 1502 

Qy 1078 GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGC 113 7 

MMIllMlMMiMllliMMiMlllllllliiillMlilliliiiillMlll 

Db 1503 GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGC 1562 

Qy 1138 CGCTUVGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIMIMIIIIIMMMMIMMIIMIIIMMMIIIIIMIIIIIIMMIMI 

Db 1563 CGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1622 



RESULT 



6 



OATGFBl 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



mat_peptide 



BASE COUNT 
ORIGIN 



OATGFBl 1173 bp mRNA linear MAM 18-APR-1995 

O.aries mRNA for transforming growth factor-beta I. 

X76916 

X76916 . 1 GI :496648 

TGF-beta 1; transforming growth factor-beta 1. 
Ovis aries (sheep) 
Ovis aries 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 
Bovidae; Caprinae; Ovis. 
1 

Woodall, C. J. , McLaren, L.J. and Watt, N.J. 

Sequence and chromosomal localisation of the gene encoding ovine 
latent transforming growth factor-beta 1 
Gene 150 {2), 371-373 (1994) 
95121932 

7821809 

2 (bases 1 to 1173) 
Woodall, C. 
Direct Submission 
Submitted (24-DEC-1993 ) C. 
Veterinary Pathology, Sc. 
Edinburgh EH9 IQH, UK 

Locat ion/Qual if iers 

1. .1173 

/organism="Ovis aries" 
/mol__type=" mRNA " 
/ db__xr e f - " t a xon : 9 9 4 0 " 
1. .1173 

/codon_start=l 

/product=" trans forming growth factor-beta 1" 
/protein_id= " CAA54242 . 1 " 
/db_xref="GI : 496649" 
/db_xref=" SWISS-PROT:P50414" 

/translation="MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTIDMELVKRK 
GIEAIRGQILSKLRLASPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEAD 
YYAKEVTRVLMVEYGNKI YDKMKSSSHSI YMFFNTSELREAVPEPVLLSRADVRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFR 
LSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
835. .1170 

/product = " trans forming growth factor-beta 1" 
378 c 336 g 214 t 



Woodall, Univ. of Edinburgh, Dept. of 
of Vet. Studies, Univ. of Edinburgh, 



245 a 



Query Match 83.5%; 
Best Local Similarity 90.7%; 
Matches 1086; Conservative 



Score 999.8; DB 4 
Pred. No. 3.7e-172 
0; Mismatches 87 



Length 1173; 
Indels 24; Gaps 



1; 



Qy 
Db 



1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III IIMIIIIMIIIIIMII IIIIIIIIMIIIIIMIllllllllll Ml II 

1 ATGCCGCCTTCGGGGCTGCGGCTGCTGCCGCTGCTGCTGCCGCTGCTGTGGCTACTAATG 60 



Qy 



6 1 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 

lllllllllllllllllll IIIMIIIMIIIIIMIIIMIIIIIIIIIIIIIIIIM 



120 



Db 61 CTGACGCCTGGCCGGCCGGTCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

IIIIIIIIMII {llllllllllll Mill llllll lllllll IIIIIIII III 
Db 121 GTGAAGCGGAAGGGCATCGAGGCCATCCGCGGTCAGATTTTGTCCAAACTTCGGCTCGCC 180 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 240 

II IIIIIIIIIIIIIIIIIIIIIM IIMIIIIIIIIII ill!! lllllll III 

Db 181 AGTCCCCCGAGCCAGGGGGACGTGCCACCCGGCCCGCTGCCCGAGGCCATACTGGCCCTT 240 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

lllllllllllllllllllllll llllllllllll! Mil MIIMI IIIMMM 

Db 241 TACAACAGTACCCGCGACCGGGTGGCCGGGGAAAGTGCCGAAACGGAGCCTGAGCCAGAG 300 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

MMIIIIMMIIIIIIIIIIMIIIIIIIMIIIIIMIMII MIIMI IIIM 

Db 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAATACGGCAACAAAATC 3 60 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

MM! Ml III! Ill IMMI MIIIMI I IIMMIMM Mill! 

Db 361 TATGACAAAATGAAGTCTAGCTCGCACAGCATATATATGTTCTTCAACACGTCCGAGCTC 42 0 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 48 0 

Mllllll MM! IIMIIII IIMIIIIIIIIIIMI IIIMMIIIIIIMll 

Db 421 CGGGAAGCAGTGCCTGAACCTGTGTTGCTCTCTCGGGCAGACGTGCGCCTGCTGAGGCTC 48 0 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIIIIIIMIIIIIIIIIIMIMMIII IIIMIIIMIIIIII IIIIIIIIIM 

Db 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATATAGCAACAATTCCTGGCGC 54 0 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

MMMIIIMIMMIMI IIMMM II M M M M M M M II M M M M III 

Db 541 TACCTCAGCAACCGGCTGCTCGCCCCCAGCGACTCACCGGAGTGGCTGTCCTTTGACGTC 600 

Qy 6 01 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

M IIIIIIIIIIMIIIIIMIIIIII IIMIIII Mill M IIIMIIIIIII 

Db 601 ACTGGAGTTGTGCGGCAGTGGCTGACCCACAGAGAGGAAATAGAAGGCTTTCGCCTCAGT 660 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 720 

lllllll MM! IMMI II IIMMM II II Mill M IIIIIIMII I 

Db 661 GCCCACTGTTCCTGTGACAGTAAGGATAACACGCTTCAAGTGGACATCAACGGGTTCAGT 72 0 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

M IMMMIIIIMMM II II M M II I M M M II M M M II II II M M M M 

Db 721 TCCGGCCGCCGGGGTGACCTCGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIIIIIIIII IIMMMIIIIIIIMIMMIIIMIIIII IMIIIIII 

Db 781 ATGGCCACCCCTCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGCCACCGCCGA 834 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

IIIIIIII lllllllllllllllllllllll II llllll 
Db 83 5 GCCCTGGACACCAACTACTGCTTCAGCTCCACAGAAAAGAAC 876 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

Mill M II lillMIIMIIIIIIMIIIMIMIIIIIIIIIIIMIIIIIIM 

Db 877 TGCTGTGTTCGTCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAC 936 



Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

IIIIIIMIIIIIIIII IIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIII 
Db 937 GAACCCAAGGGCTACCACGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTG 996 



Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCG^ 1080 

Mill IIIIIIIIIMIIIIIIIII IIIIMIIIMMIIIIIIIIIIIII IMIII 

Db 997 GACACACAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCATCGGCG 1056 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

lllilMIIMIIIMI lllllllllll II IIIIIIIIIIIIIIIIIIIMIIIIII 

Db 1057 GCGCCGTGCTGCGTGCCTCAGGCGCTGGAACCCCTGCCCATCGTGTACTACGTGGGCCGC 1116 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIIIIIIMIII lllllllllllllllllll IIIIIIIMIIIIIIIII 

Db 1117 AAGCCCAAGGTGGAGCAGTTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1173 



RESULT 7 
DOGTGFBIA 

LOCUS DOGTGFBIA 1369 bp mRNA linear MAM 30-OCT-1994 

DEFINITION Canine transforming growth factor-beta 1 (TGFBl) mRNA, complete 

cds - 

ACCESSION L34 956 

VERSION L34956.1 GI: 516071 

KEYWORDS homologue; transforming growth factor-beta 1. 
SOURCE Canis familiaris (dog) 

ORGANISM Canis familiaris 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 
REFERENCE 1 (bases 1 to 1369) 

AUTHORS Manning, A. M. , Auchampach , J . A. , Drong,R.F. and Slightom,J,L. 
TITLE Cloning of a canine cDNA homologous to human transforming growth 

factor-beta 1 (TGFbetal) 
JOURNAL Unpublished (1994) 
COMMENT Original source text: Canis familiaris adult jugular vein 

endothelial cDNA to mRNA. 
FEATURES Location/Qualifiers 
source l. .1369 

/organism="Canis familiaris" 
/mol_type="mRNA" 
/ db_xr e f = " t axon : 9 6 1 5 " 
/eel l__type= "LPS -activated" 
/tissue_type=" jugular vein endothelial" 
/devest age= "adul t " 
gene 1. .1369 

/gene= "TGFBl" 
5'UTR 1. .57 

/gene=" TGFBl" 
CDS 58. .1230 

/gene= "TGFBl" 

/ funct ion=" ant i- inflammatory agent" 

/not e= "precursor" 

/codon_start=l 

/product=" transforming growth factor-beta 1" 
/protein__id="AAA51458. 1" 
/db_xref="GI : 516072" 



mat_peptide 



3 'UTR 



BASE COUNT 
ORIGIN 



/translation="MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLSSPPSQGEVPPVPLPEAVIiALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVENTNKIYEKVKKSPHSIYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKAEQHVELYQKYSNDSWRYLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFR 
LSAHCSCDSKDNTLQVDINGFSSSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQ 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
Q YS KVLALYNQHN PGASAAPCCVPQALEPLP I VY YVGRKPKVEQLSNM I VRSCKCS " 
1039. .1227 
/gene="TGFBl" 
/product=" transforming growth factor-beta 1" 
1231. .1369 
/gene=" TGFBl" 
264 a 473 c 415 g 216 t 1 Others 



Query Match 80.9%; Score 967.8; DB 4; Length 1369; 

Best Local Similarity 89.1%; Pred. No. 2.4e-166; 

Matches 1066; Conservative 0; Mismatches 107; Indels 24; Gaps 



1; 



Qy 


1 


Db 


58 


Qy 


61 


Db 


118 


Qy 


121 


Db 


178 


Qy 


181 


Db 


238 


Qy 


241 


Db 


298 


Qy 


301 


Db 


358 


Qy 


361 


Db 


418 


Qy 


421 


Db 


478 


Qy 


481 


Db 


538 


Qy 


541 



CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

IIIIIIIIIIIIIMIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIII III 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAACTG 177 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

llllllllllllllllllllllllll MIMIIIIII lllllllllll Mill II 
GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCTCC 237 



III 



TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

IMIIIII llllllllllllll II Mill II I Ml llllllllllllll III 

TACAACAGCACCCGCGACCGGGTGGCGGGGGAGAGCGCCGAGCCGGAGCCCGAGCCCGAG 357 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

IMIIIIIIMIIMIIIIIIIIIIIIIIIMIIIIIMIIMIII I MM Mill 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAACACCAACAAAATC 417 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

Mill III Mill I M llllll IIIIIIMM IMIIIII II llllll 

TATGAGAAAGTCAAGAAAAGTCCGCACAGCATATATATGCTCTTCAACACATCAGAGCTC 477 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 48 0 

II Mill Mill II II II Mill II I II II I Ml I Mill Mill MM III 

CGAGAAGCAGTGCCTGAGCCCGTCTTGCTCTCCCGGGCAGAGTTGCGCCTGCTGAGGCTC 537 



MMIMIM IIIIIMM IIIMIM MMIMMM IMIMMMIMMIM 



TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 
lllllllllllllllllllllll Mill III I II II llllllllllllllllll 



Db 



598 



TACCTCAGCAACCGGCTGCTGGCGCCCAGCGACACGCCAGAATGGCTGTCCTTTGATGTC 



657 



Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

II Mill III llllllllllll II III II I Mill IIIIIIIIIIM 
Db 658 ACTGGAGTCGTGAGGCAGTGGCTGAGCCATGGAGGGGAAGTCGAGGGCTTTCGCCTCAGT 717 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

IMMII IIMI IMMIIIIIIMIMMIM II II II IMIMMMMI I 

Db 718 GCCCACTGTTCCTGTGACAGCAAAGATAACACACTGCAAGTAGACATTAACGGGTTCAGT 777 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

II IMMII IIIIIIIIIIIIIIIIIIM lllllllllll IIIIIIIIIIIIIM 

Db 778 TCCAGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGACCCTTCCTGCTCCTC 837 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

lllllllllll IIIMIIIIIIIIIIIIIIIMIIMIIillllll IIMI 

Db 838 ATGGCCACCCCACTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCAGCGCCG 890 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 90 0 

IIIIIIMI IIIIIIIIIMIMIIIMMIIIIIIIIIIII 

Db 891 GGCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 933 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

llllllll lllllllllllllllllllllll Mill IIIIMIIIIIIIllll III 

Db 934 TGCTGCGTCCGGCAGCTCTACATTGACTTCCGCAAGGATCTGGGCTGGAAGTGGATCCAT 993 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

II llllllll Mill II II IIIIIIIIIMIIIIII llllllll llllllll 

Db 994 GAGCCCAAGGGTTACCACGCTAACTTCTGCCTGGGGCCCTGCCCCTACATTTGGAGCCTG 1053 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

MM! IIIMMIIIIIIMMIM II II M I II M M M M M M II M II II M II 

Db 1054 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1113 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1140 

IIIIIIIIIIMMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 

Db 1114 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1173 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIMIIIMIIIIIIIIII IIIIMIMIIIII IIIIIIIIIMIIIIIII 

Db 1174 AAGCCCAAGGTGGAGCAGCTGTCGAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1230 



RESULT 8 
AGMTGFB 

LOCUS AGMTGFB 1561 bp mRNA linear PRI 27-APR-1993 

DEFINITION Simian transforming growth factor-beta (TGF) mRNA, complete cds . 
ACCESSION M16658 
VERSION M16658.1 GI: 176552 

KEYWORDS growth factor; transforming growth factor-beta. 
SOURCE Cercopithecus aethiops (African green monkey) 

ORGANISM Cercopithecus aethiops 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 
Cercopithecinae; Cercopithecus . 
REFERENCE 1 (bases 1 to 1561) 

AUTHORS Sharpies, K., Plowman, G . D . , Rose,T.M., Twardzik, D.R. and 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 

FEATURES 

source 



CDS 



sig_peptide 
mat__peptide 



BASE COUNT 
ORIGIN 



Purchio,A. F, 

Cloning and sequence analysis of simian transforming growth 

factor-beta cDNA 

DNA 6 (3) , 239-244 (1987) 

87246074 

3474130 

Original source text: African green monkey cells (cell line 
BSC-40) , cDNA to mRNA, clone pTGF-beta-2. 
Locat ion/Qual i f iers 
1. .1561 

/organism=="Cercopithecus aethiops " 
/mol_type="mRNA" 
/db_xref="taxon; 9534" 
262. .1434 

/note=" transforming growth factor-beta precursor" 
/ codon_start=l 
/protein_id="AAA35369 .1" 

/db_xref="GI : 176553" 

/translation="MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTIDMELVKRK 
RIETIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEIYDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSKDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMTVRSCKCS" 
262. .1095 

/note=" transforming growth factor-beta signal peptide" 
1096. .1431 

/product="transf orming growth factor-beta" 
301 a 547 c 446 g 267 t 



Query Match 79.9%; 
Best Local Similarity 88.5%; 
Matches 1059; Conservative 



Score 956.6; DB 9 
Pred. No. 2.6e-164 
0; Mismatches 114 



Length 1561; 
Indels 24; Gaps 



1; 



Qy 



Db 



1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III INI II lllllllllll lllllllllllll llllllllllllll II III 
262 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 321 



Qy 

Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 



61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

iiiiiiiii lllllllllllll Hill iiiiiiiiiiiiM iiiiiiiiiiiiiii 

322 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 381 
121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

MIIIIMIIIIMIIIIIII MM MIIIIIMII IIIMMMM IMM Ml 

382 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCIAGATCCTGTCCAAGCTGCGGCTCGCC 441 

181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 240 

IMIIIIIIIIMIIIIIII MIIIIIIIIIIIMMIII IMM II II II II 

442 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 501 
241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

IIIIIMI IIIIIIIIIMIII llllllll MM II lllllllllll II III 

502 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 561 



Qy 



3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 



Db 562 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 621 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACT^GCTTATATATGCTGTTCAACACGTCGGA 420 

lllll II iiMii I I I iiiiii Illlllll I Illlllll II mill 

Db 622 TATGACAAGTTCAAGCAGAGCACACACAGCATATATATGTTCTTCAACACATC^^ 681 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 48 0 

II lllll II II Illlllll Illlllll llllllllllllll llllllllllll 

Db 682 CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 741 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIIIMIIIIIIIIIIIIII Illlllll Illllllllllllllll llllllllll 

Db 742 AAGTTAAAAGTGGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 801 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

lllllllllllllllllllllll lllll Mil Illlllll! MM lllllllll 

Db 802 TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 861 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIIIIIIIIMMIIIMIII III MM III II II lllll IMIIIII II 

Db 862 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 921 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 720 

lllllll MM lllllllllllllllllllll II lllll M llllllllll I 

Db 922 GCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGTTCACT 981 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

I Mllllll llllllllllllll lllll IIIIIIIMIIIM Illlllll III 

Db 982 ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGC^ 1041 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

lllllllllll llllllllllllll II lllll llllllllllllllllll 

Db 1042 ATGGCCACCCCACTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA 1095 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

IIMIIM IIIIIIIIIIIIIIIIIMMMIMMMMI 

Db 1096 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 1137 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 96 0 

IIMIMIIIIIMMI II IIMMMMI MIMIM IIMIIMIMIII II 

Db 1138 TGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1197 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

M IIIIIIIIMMMMMII IIMMIIIIillllMMIIMIM MIMIM 

Db 1198 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTG 1257 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

II Ml Ml MIMIM Mi MM li MMMMMMM MMiMMM Mini 

Db 1258 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1317 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1140 

M MM Ml MMMMMMM I MM M MMMMMMM MMMMIMMMI 

Db 1318 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1377 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

MMIMMMIMMMMMMMMMMMMM MMMM IMMMM 



Db 1378 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAATGCAGCTGA 1434 



RESULT 9 
BC022242 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BC022242 1746 bp mRNA linear PRI 04-FEB-2002 

Homo sapiens, clone MGC:22008 IMAGE:4399762, mRNA, complete cds . 
BC022242 

BC022242 .1 GI : 18490115 
MGC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1746) 
Strausberg, R. 
Direct Submission 

Submitted (Ol-FEB-2002) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact : MGC help desk 
Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The T.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Sequencing Group at the Stanford Human Genome 

Center, Stanford University School of Medicine, Stanford, CA 94305 

Web site: http://www-shgc.stanford.edu 

Contact: (Dickson, Mark) mcd@paxil.stanford.edu 

Dickson, M., Schmutz, J., Grimwood, J,, Rodriquez, A., and Myers, 
R. M. 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 27 Row: e Column: 21 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORE 
analysis . 

FEATURES Loca t i on/Qua 1 i f i ers 

source 1. .1746 

/organism="Homo sapiens" 

/ mo 1 _t yp e = " mRNA " 

/db_xref ="taxon: 9606" 

/clone= "MGC: 22008 IMAGE : 4399762 " 

/ 1 i s sue_t ype= " Duodenum , adenocarc inoma " 

/ c 1 one_l i b= " N I H_MGC_8 8 " 

/lab_host="DH10B" 

/note= "Vector: pCMV-SP0RT6" 
CDS 370. ,1542 

/codon_start=l 

/product = "Unknown (protein for MGC:22008)" 
/protein_id="AAH22242 .1" 

/db_xref ="GI : 18490116" 

/translation="MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 



YYAKEVTRVLMVETHNE I YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRL 
PCLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 

BASE COUNT 376 a 612 c 472 g 286 t 

ORIGIN 

Query Match 79.9%; Score 956.6; DB 9; Length 1746; 

Best Local Similarity 88.5%; Pred. No. 2.6e-164; 

Matches 1059; Conseirvat ive 0; Mismatches 114; Indels 24; Gaps 1; 



Qy 


1 


Db 


370 


Qy 


61 


Db 


430 


Qy 


121 


Db 


490 


Qy 


181 


Db 


550 


Qy 


241 


Db 


610 


Qy 


301 


Db 


670 


Qy 


351 


Db 


730 


Qy 


421 


Db 


790 


Qy 


481 


Db 


850 


Qy 


541 


Db 


910 


Qy 


601 


Db 


970 



lllllll lllllllllll lllllllllll Mill III 



AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

Illlllllllllllllllll IIIIIIIIIIMIIIMMI Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 609 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

llllllll llllllllllllll llllllll MM IIIIIIIIIIIIIIIM III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 669 



II IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII I MM 



TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

Mill II llllll I I llllll llllllll I llllllll II llllll 
TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACAC^ 789 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

II llllllll M Mill II IIIIMII IIIMIIIIIIIII llllllllllll 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 84 9 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIIIIIIIIIIIIIMIIIIIIIIIIIII lllllllllllllllll llllllllll 
AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGG^^^ 909 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGl'GGCTGTCCTTTGATGTC 600 

IIIIMIMIIIIIIIIIIIIII Mill Mill M IIMII I II lllllllll 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 969 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIIIIIIIIIIIIIIMIIII Ml Ml III II II Mill llllllll II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1029 



Qy 



661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 



720 



Db 


1030 


Qy 


721 


Db 


1090 


Qy 


781 


Db 


1150 


Qy 


841 


Db 


1204 


Qy 


901 


Db 


1246 


Qy 


961 


Db 


1306 


Qy 


1021 


Db 


1366 


Qy 


1081 


Db 


1426 


Qy 


1141 


Db 


1486 



IMIIII MM MMMM MIMMMM M MMI M MMMMM I 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCT^ra^^ 1089 



I MIMMI MMMMMMMMMM MMMMMMM MMMM IM 



M 



AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

MMMM MMMM MMMMMMMMMMMM 



1245 



TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

MMMMMMMMI MMMMMMM MMMM MMMMMMM M 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 13 05 



GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAAC 1080 

MMI MMMMMMMMMM MMMMMMM MMMMMI MMU 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1425 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

MMMMMMMMIMMMMMMMI M M M M M M M M M M M M M I 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1485 



MMMMMMMMMMMMMI MMMI 



RESULT 10 
BC000125 

LOCUS BC000125 1780 bp mRNA linear PRI 12-JUL-2001 

DEFINITION Homo sapiens, Similar to transforming growth factor, beta 1, clone 

MGC:3119 IMAGE: 3351664, mRNA, complete cds . 
ACCESSION BC000125 

VERSION BC000125.1 GI: 12652748 

KEYWORDS MGC . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 1780) 
AUTHORS Strausberg, R . 
TITLE Direct Submission 

JOURNAL Submitted (03-NOV-2000) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov 
COMMENT Contact: MGC help desk 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Genome Sequence Centre, 

BC Cancer Agency, Vancouver, BC, Canada 

info^bcgsc .be . ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pa wan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi , Jacqueline 
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, 
George Yang, Scott Zuyderduyn, Marco Marra. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort lum/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 6 Row: e Column: 11 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORF 
analysis . 

Location/Qualifiers 
1. .1780 

/organism="Homo sapiens" 

/ mo 1 _t yp e = " mRNA " 

/db_xref ="taxon: 9606" 

/clone="MGC:3119 IMAGE : 3351664 " 

/ 1 i s sue_type= " Eye , ret inobla stoma " 

/ c 1 one__l ib= " N I H_MGC_1 6 " 

/lab_host=" DHIOB-R" 

/note= "Vector: pOTB7" 

447. .1619 

/codon_start=l 

/product="Similar to transforming growth factor, beta 1" 
/protein_id="AAH0 0125 . 1" 
/db_xref ="GI : 12652749" 

/ 1 rans 1 a t i on= " MP PSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKE VTRVLMVETHNE I YDKFKQSTHS I YMFFNTS ELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCS CDS RDNTLQ VD I NGFTTGRRGDLAT I HGMNR P FLLLMATPLERAQHLQS S RH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
341 a 648 c 493 g 298 t 



Query Match 79.9%; Score 956.6; DB 9 

Best Local Similarity 88.5%; Pred. No. 2.6e-164 
Matches 1059; Conservative 0; Mismatches 114 



Qy 

Db 



Length 1780; 

Indels 24; Gaps 1; 

1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

Ml MM li IMMIIIIM Ml IIIIIIMI IIMIIIIMIIII M III 

447 ATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 506 



Qy 



61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

MMMMMMMMMMMI MMI MMMMMMM IMMMMMMM 



Db 507 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 566 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

IIIMIIIIIIIIIIIIIIIIIIIII lllllllllll lllllllllll Mill III 

Db 567 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 626 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIMIIIIIIIIIIIIIIII IIIIMIIIIIIIIIMIIl lllll II II II II 
Db 627 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 686 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCX^GAGCCCGAGCCAGAG 3 00 

Mllllll IIIIIIMIIIMI llllllll MM IIIIIIMIIIIIIIII III 

Db 687 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 746 

Qy 3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II IIIIIIIIIIIMIillllllllllllllllilllllllllll I MM lllll 

Db 747 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 805 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

lllll II llllll I I llllll IIMIMI I IIIIIIM II llllll 

Db 8 07 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATC^^ 866 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

II IIMMM M Mill II IIMIMI IIMMIIIIMM MIIIIMIIM 

Db 867 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 926 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIIIIIIIMIIIIIIIIIIIIIIIIIII lllllllllllllllll IIIIIMMI 

Db 927 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 986 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIIMIIIIIIIIIIIIIIIII lllll lllll II llllll I II lllllllll 
Db 987 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1046 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

MMMIMIMMMMIII Ml Ml Ml M II Mill IIMIMI M 

Db 104 7 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1106 

Qy 661 GCCC7VCTCTTCCTCTGAC7i.GaVAAGATAACACACTCCACGTGGAAATTAACGGGTTCAA 720 

lllllll MM Mllllll IIIIIMIIII II lllll II IIIIIMIII I 

Db 1107 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1166 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

I IIMMM IIIIIMIMIIIIIIIIII llllllllllllll llllllll III 

Db 1167 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1226 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

MIIMIIMMIIIIIMMIMIMM Mill IMMMIIMMMIII 

Db 1227 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 1280 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

IIIIIIM IIIMIII IIIIIIIIIIIIIMIIIIIIIII 

Db 1281 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1322 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIMIIIMMIIIII IIIIIMMIIMI llllllll llllllllllllll II 

Db 1323 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1382 



Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

M IMIIIIIIIIIIIIIMII IIIIIIM llllllll llllilll llllllll 
Db 1383 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1442 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACC^ 1080 

Mill IIMIIIIMIMIIIIIII IIIIIIIIMIIII IIIMIIIIII Mini 

Db 1443 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1502 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

IIIIIIIMIIIIIIIMIIIIIIMIIIIII IIIIIIIIIMIIIIIIIIIIIIIMI 

Db 1503 GCGCCGTGCTGCGTGCCGG^GGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1562 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

llllllllllllllllllllllllllllllllllllll IIIIIIIIIIIIIIIIM 

Db 1563 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1619 



RESULT 11 

BC001180 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC001180 1780 bp mRNA linear PRI 12-JUL-2001 

Homo sapiens, Similar to transforming growth factor, beta 1, clone 
MGC:2323 IMAGE;3356605, mRNA, complete cds . 

BC001180 

BC00118 0 . 1 GI : 12654682 
MGC. 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Ca tarrhini; Hominidae; Homo. 
1 (bases 1 to 1780) 
Strausberg, R. 
Direct Submission 

Submitted {ll-DEC-2000) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact: MGC help desk 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Genome Sequence Centre, 
BC Cancer Agency, Vancouver, BC, Canada 
inf o@bcgsc . be . ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywmski, Reta Kucsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline 
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thome, Miranada Tsai, Natasja van den Bosch, Jill Vardy, 
George Yang, Scott Zuyderduyn, Marco Marra. 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/ LLNL at: http://image.llnl.gov 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Series: IRAL Plate: 6 Row: e Column: 10 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 37097. 

Loca t i on / Qua 1 i f i er s 

1. .1780 

/organism="Homo sapiens" 

/mol__type="mRNA" 

/db_xref ="taxon: 9606" 

/clone="MGC:2323 IMAGE : 3356605" 

/ 1 issue_type= " Eye , ret inoblastoma " 

/clone_lib="NIH_MGC_16" 

/lab_host=" DHIOB-R" 

/not e=" Vector: pOTB7" 

447. .1619 

/ codon_start=l 

/product="Similar to transforming growth factor, beta 1" 
/protein_id="AAH01180. 1" 
/db_xref="GI : 12654683" 

/translation="MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVIALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEIYDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
341 a 648 c 493 g 298 t 



Query Match 79.9%; 
Best Local Similarity 88.5%; 
Ma t Che s 1059; Con s erva t i ve 



Score 956.6; DB 9; 
Pred. No. 2.68-164/ 
0; Mismatches 114; 



Length 178 0; 
Indels 24; Gaps 



1; 



Qy 



Db 



1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III III! II IMIlllllll III lllllllll llllllllllllll II III 

447 ATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 506 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 



61 



507 



CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 

lllllllllllllllllllllll mil llllllllllllll lllllllllllllll 
CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 



120 



566 



121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

iiiiiiiiiiiiiiiiiiiiiiiiii iiiiiiiiiii IMIlllllll mil III 

567 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 626 



181 



240 



AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

miiiiiiimiiiim mmmimmim mii ii ii ii ii 

627 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 686 



241 



687 



TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

miiiii iiiiimiiim mum im iimiiiiiiiiiiii iii 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 74 6 



3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

M iiiiiiimiiiiiiiiiNiiiiiiiiiiiiiiiiiiiiii I Mil mil 

747 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 806 



Qy 



361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 



Mill II IIIIII I I IIIIII IIIIIIII I IIIIIIII II IIIIII 

0 7 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATC^^ 



Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

II IIIIIIII II mil M IIIIIIII IIIIIIIIIIIIII MIIMIIIIII 

Db 867 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 926 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

Illllllllllllllllllllllllllll Illllllllllllllll llllllllll 
Db 927 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 986 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

iiiiiiiiiiiiiiiiiiiiiii Hill mil II IIIIII I II iiiiiiiii 

Db 987 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1046 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

iiiiiiiiiiiiiiiiiiiii III III III 11 II mil iiiiim II 

Db 1047 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1106 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 720 

lllllll Mil IIIIIMI IIIIMIIIII II Mill II IIIIMIIM I 

Db 1107 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1166 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I IIIIIIII IIIIIIIIIIMIIIIIIII IIIIIIIIIIIIII IIIIIIII III 
Db 1167 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1226 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

Illllllllllllllllllllllllllll Mill llllllllllllllllll 
Db 1227 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 128 0 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 9 00 

IIIIIIII iiiiiiM iiiiiiiiiiimiiiiiiiiii 

Db 1281 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1322 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

lllllllimiMIM IIMMIIIIMII IIIIIIII IIIIIIIIIIIIII II 

Db 1323 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1382 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II immmMMMim imim miim immi iimm 

Db 1383 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1442 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill IMIIIIIIIIIIMIIIII IMIIIIIIIIIM IIIIIIIIIM IIIIII 

Db 1443 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1502 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

III III I I III I III II III III III I i M i I M M Ml III II III III I III III II 

Db 1503 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1562 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

mmimmmmimmmimiMii mmmimmii 

Db 1563 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1619 



RESULT 12 



E03028 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

COMMENT 



E03028 1821 bp RNA linear PAT 29-SEP-1997 

DNA encoding human prepro TGF-betal. 

E03028 

E03028.1 GI:2171250 
JP 1991180192-A/l. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1821) 

Ohashi,H., Ishii^Y., Miyata^Y., Miyazono^K., Miyagawa,K. and 
Takaku,F. 

PRODUCTION OF HUMAN PRO-TGF-BETAl BY GENETIC RECOMBINATION 

Patent: JP 1991180192-A 1 06-AUG-1991; 

KIRIN BREWERY CO LTD 

OS Homo sapiens (human) 

PN JP 1991180192-A/l 

PD 05-AUG-1991 

PF 07-DEC-19B9 JP 1989318243 

PI OHASHI HIDEYA, ISHII YASUYUKI , MIYATA YOSHINORI , PI MIYAZONO 
KOHEI , 

PI MIYAGAWA KIYOSHI, TAKAKU FUMIMARO 

C12P21/00,C07K13/00,C12N5/10, C12N15/18//C12Q1/68, (C12P21/00, 
C12R1:91) , 

(C12N15/18,C12R1:91) ; 
strandedness : Double; 
topology: Linear; 
hypothetical: No; 
ant i- sense: No; 

*source : tissue_type=placenta; 
clone=pVC19-TGF; 

Location/Qualifiers 



PC 
PC 
PC 

cc 
cc 
cc 
cc 
cc 
cc 

FH 
FH 
FT 
FT 
FT 

sig_peptide 
FT 
FT 
FT 
FT 
FT 



♦source: 
Key 



5'UTR 
CDS 



mat^peptide 
mat_peptide 
3 'UTR 



1. .510 

511. .1683 

/product = 'human prepro TGF-betal' FT 
511. .597 

598. .1680 

/note= 'human pro TGF-betal' 
1345. .1680 

/product = 'human TGF-betal' 
1684. .1820. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Location/Qualifiers 
1. .1821 

/organism="Homo sapiens" 
/mol_type= "genomic RNA" 
/db_xref ="taxon: 9606" 
326 a 679 c 508 g 308 t 



Query Match 79.9%; Score 956.6; DB 6 

Best Local Similarity 88.5%; Pred . No. 2.6e-164 
Matches 1059; Conservative 0; Mismatches 114 



Length 1821; 

Indels 24; Gaps 1; 



Qy 



1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III MM II lllllllllll lllllllllllll IIIIIIIIIIIIM M III 



Db 



512 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 571 



Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

lllllllllllll lllllllll lllll llllllllllllll lllllllllllllll 
Db 572 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 631 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

llllllllllllllllllllllllll lllllllllll IMIIIIIIII lllll III 
Db 632 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 691 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIIIIIIIIIIMIIIIII IIIMIIIIMIIMMIII lllll II M II II 

Db 692 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 751 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

llllllll llllllllllllll llllllll INI lllllllllllllllll III 
Db 752 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 811 

Qy 3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 60 

II IIIIIIIIIMIIIIMIIIIIIIIMIIIIIIIIIIIIIIII I nil Mill 

Db 812 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 871 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

lllll II llllll I I llllll llllllll I llllllll II llllll 
Db 872 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 931 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 48 0 

II llllllll II lllll M llllllll llllllllllllll llllllllllll 
Db 932 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 991 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

llllilllllllllllllllllllllMI lllllllllllllllll llllllllll 

Db 992 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 1051 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 60 0 

Illllllllllllllllllllll lllll lllll II llllll I II lllllllll 
Db 1052 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1111 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

Illllllllllllllllllll III III III II II lllll llllllll II 
Db 1112 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1171 

Qy 6 61 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

lllllll MM llllllll MIIMIIIII II Mill II llllllllll I 

Db 1172 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1231 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

I IIIIIIM IIIIIIIIIIIIIIIIIIM llllllllllllll IIIIIMI Ml 

Db 1232 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 12 91 

Qy 7 81 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIIMIIMIIIIIIIIIIMIIIIIII lllll IIIIIIIIIIMIIIIII 

Db 12 92 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 1345 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

IMIIIII IIIIIIM IIIIIIMIIIMIIIIIIIIMI 

Db 1346 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1387 



Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIIIIIIIIIIIMM IIIIIMMIIIII llllllll MIIIIIMIIIII II 

Db 138 8 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1447 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

II IIIIIIIIIIIIIIIMIII llllllll llllllll llllllll llllllll 

Db 1448 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1507 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

Mill llllllllllllllllllll IIIIIIMIIIIII lllllllllll llllll 
Db 1508 GACACGCAGTACAGCAAGGTCCTGGCCCTGTAOU^CCAGCATAACCCGGGTO 1567 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

llllllllllllllllllllllllllllllll Mlllllllllllllllllllllllll 

Db 1568 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1627 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

llllllllllllllllllllllllllllllllllllll IIIIIMIIIIIIIIMI 
Db 1628 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1684 



RESULT 13 

A06669 

LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 

REFERENCE 

AUTHORS 
TITLE 
JOURNAL 
FEATURES 

source 



CDS 



A06669 2537 bp 

Synthetic mRNA for preTGF-Betal . 
A06669 

A06669.1 GI:412940 

synthetic construct 
synthetic construct 
artificial sequences . 
1 (bases 1 to 2537) 



mRNA 



linear PAT 29 -JUL- 1993 



USE 



BASE COUNT 
ORIGIN 



NUCLEIC ACID ENCODING TGF- beta 3 AND ITS 
Patent: WO 8912101-A 4 14-DEC-1989; 

Locat ion/ Qual if iers 

1. .2537 

/organism= " synthetic construct " 

/mol_type="mRNA" 

/db_xref = " taxon : 3 2 63 0 " 

842. .2014 

/codon_start=l 

/transl_table=ll 

/product="preTGF-betal" 

/protein_id="CAA00588 . 1" 

/db_xref="GI : 412 941" 

/translation="MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRK 
RI EAI RGQI LSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDI NGFTTGRRGDLATI HGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
473 a 893 c 739 g 432 t 



Query Match 79.9%; Score 956.6; DB 6; Length 2537; 

Best Local Similarity 88.5%; Pred . No. 2.5e-164; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 

ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM II IMIIMIIM IIIMIMIMII IMIMIIIIIIII II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 
CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

MMIMMIMI IIMIMII Mill IIIMIIIIMIM MIIIIIIIIIIIM 

CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 
GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

IIIIIIIIIMIIIIIIMIIIMII IMMIIIIII IMIIIIIIII Mill III 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

MMMMIIIIIIIIIIII IIMMMIIIIIIIIIIM Mill M II II M 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

llllllll IIIIIMIIIMII llllllll MM IMIIIMIMIIIIII III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II IMIIIIIIIMIMMIIMIIIIIIIIMIIIIIIMIIII I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 12 01 
TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill M llllll I I llllll llllllll I IIIIMII II MUM 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACA^ 1261 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

II MIIIIM II IMM II llllllll llllllllllllll llllllllllll 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 1321 
AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 540 

IIIIIIIIIIIIMIIIIIIIIIIIIMI IIIIIIIIIIIIIMII llllllllll 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 1381 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIIMIIMIIIIMIMIIII Mill Mill II IIMII I II IIMIIIII 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIIIIIIIIIIMIIIIIIII III III Ml II II Mill MIIIIM II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1501 
GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

IMIIII MM llllllll IMMIIIIII II Mill II llllllllll I 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1561 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I MIIIIM IIIIMIIIIIIMIIIIII IlilMIIIIIIII llllllll III 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 



Qy 


1 


Db 


842 


Qy 


61 


Db 


902 


Qy 


121 


Db 


962 


Qy 


181 


Db 


1022 


Qy 


241 


Db 


1082 


Qy 


301 


Db 


1142 


Qy 


361 


Db 


1202 


Qy 


421 


Db 


1262 


Qy 


481 


Db 


1322 


Qy 


541 


UD 


1 "3 Q O 
IJ O Z 


Qy 


601 


Db 


1442 


Qy 


661 


Db 


1502 


Qy 


721 


Db 


1562 



Qy 



781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 840 



IIIIMIIIIIIIIIIIIIIIIIIIIIII Mill IIIIIIIIIIIIMMII 

1622 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 



Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

IIIIMII IIMIMI llllllllllllllllllllllll 

Db 1676 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1717 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIIIIIIIIIMIIII llllllllllllll llllllll llllllllllllll II 

Db 1718 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

II MIMIIIIIIIIIIIllll llllllll llllllll llllllll llllllll 

Db 1778 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill IIMIIIIIIIIIIIIIIII llllllllllllll lllllllllll llllll 

Db 1838 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1897 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1140 

IIIIMIIIIIIIIIIIIIIIIIIIMIIIII MMIIIIIIIIIIIIMIIIIIIIII 

Db 1898 GCX5CCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIMIIIIIIIIMIIIIIIIIIMIIIIIIIIIIII IIIIIIIIIMIIIIIII 

Db 1958 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCA?VGTGCAGCTGA 2014 



RESULT 14 

E00973 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini ; Hominidae ; Homo . 



E00973 2527 bp RNA linear PAT 29-SEP-1997 

cDNA encoding human TGF-beta. 

E00973 

E00973 . 1 GI :2169234 
JP 1986219395-A/l. 
Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chorda ta ; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 2527) 
Riku,M.A.D.D. and Debitsudo, B .G. 
NUCLEIC ACID CODED WITH TGF-BETA AND ITS USE 
Patent: JP 1986219395-A 1 29-SEP-1986; 
GENENTECH INC 
OS human 

PN JP 1986219395-A/l 

PD 29-SEP-1986 

PF 20-MAR-1986 JP 1986064661 

PR 22-MAR-1985 US 85 715142 

PI RIKU MAIKERU ANDORE DERINKU, DEBITSUDO BANNOOMAN GETSUDERU PC 

C12P21/00, C12Nl/00,C12N5/00,C12N15/00//C12Ql/68, (C12P21/00, PC 
C12R1:91) , 

PC (C12N1/00,C12R1 : 19) , (C12N5/00 , C12R1 : 91) , (C12N15/00 , C12R1 : 91 ) ; 
CC strandedness : Double; 
CC topology: Linear; 
CC hypothetical: No; 
CC ant i- sense: No; 



cc *source: tissue_type=placenta and glyoblastoma ; FH Key 
Locat ion/Qual i f iers 

FH 

FT CDS 842. .2014 

FT /product= 'pre TGF-beta' 

FT mat_peptide 1676. .2011 
FT /product:= 'TGF-beta ' 

FT 5'UTR 1. .841 

FT 3'UTR 2015. .2537 

FT stem_loop 37. .113 

FT stem_loop 2015. .2100 

FT polyA_site 2514. .2519. 

FEATURES Location/Qualifiers 
source 1 . . 2527 

/organism="Homo sapiens" 
/mol_type= "genomic RNA" 
/db_xref = " taxon : 96 06 " 
BASE COUNT 472 a 888 c 735 g 432 t 

ORIGIN 

Query Match 79.8%; Score 955; DB 6; Length 2527; 

Best Local Similarity 88.4%; Pred. No. 4.9e-164; 

Matches 1058; Conservative 0; Mismatches 115; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III Mil II lllllllllll lllllllllllll llllllllllllll II III 

Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

lllllllllllll MIIMIII Mill llllllllllllll IIIIIIIIMIIIII 

Db 902 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

IIIIIIIIIIIIIIIIIIMIIIIII lllllllllll lllllllllll IIIM III 

Db 962 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIIIMIMIMIIIIIII IIIIIMIMIIIIIIIIll mil II II II II 

Db 1022 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTC 1081 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

IIIIMII llllllllllllll llllllll MM IIIIIIIMIIIIIIII III 

Db 1082 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 

Qy 3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

M MIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIII I MM Mill 

Db 1142 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 12 01 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill M lIMM I I IMIM MIIMM I IIMIMI M MIIM 

Db 1202 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 1261 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

II llllllll II Mill II llllllll IIIIIIIIMIIII IIIIIIIIIMI 

Db 1262 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 1321 



Qy 



481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 



Db 


1322 


Qy 


541 


Db 


1382 


Qy 


601 


Db 


1442 


Qy 


661 


Db 


1502 


Qy 


721 


Db 


1562 


Qy 


781 


Db 


1622 


Qy 


841 


Db 


1676 


Qy 


901 


Db 


1718 


Qy 


961 


Db 


1778 


Qy 


1021 


Db 


1838 


Qy 


1081 


Db 


1898 


Qy 


1141 


Db 


1958 



IIIIIIIMIIIIMIIMIIMIIIIII Illllllllllllllll IIIIMIIIl 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGC^^ 1381 



IIIIIIIIIIIIIIIIMIllll lllll lllll II llllll I II lllllllll 



llllllllllll III III Ml II II lllll llllllll II 



GCCCACTCTTCCTCTGACAGCAAAGATAACAGi^CTCCACGTGGAAATTAACGGGTTCAAT 72 0 

lllllll Mil llllllll lllllllllll II lllll II IIIIMIIIl I 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1561 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I llllllll IIIIIIIIIIIIIIIIMII llllllllllllll IIIIMII III 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 



IIIIIIIIIMIIIIIIIIIIIIIIIM lllll IMIIIIIIIIIIIIIII 



AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 9 0 0 

IIIIMII IIMIIII IIIIIIIIIIIIIIIIIIIIIMI 



1717 



TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IMIIIIMIIIIMM MIIIIIIIIMM IMIIIM MMIMIMMII II 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II llllllllllllllllllll llllllll IMIIIM IIIMIM IMIIIM 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 

GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

Mill IIIIIIIIIIIIIIMIIM IIIIIIIIIIIIM lllllllllll llllll 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 18 97 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

IMMMMMIMMIIIIIMMMMMI M M I M M M M M M M M M M M 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACXSTGGGCCGC 1957 
AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIIIMIII IIIMIIIIIIIIIIIIIIMI IIIIIIIMMIIIIIII 

AAGCCCAAGGTGGAGGAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2014 



RESULT 15 

BT007245 

LOCUS BT007245 1173 bp mRNA linear PRI 13-MAY-2003 

DEFINITION Homo sapiens transforming growth factor, beta 1 (Camurati-Engelmann 

disease) mRNA, complete cds . 

ACCESS I ON BT007245 

VERSION BT007245.1 GI:30583328 

KEYWORDS FL I _CDNA . 

SOURCE Homo sapiens (human) 



ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 1173) 

AUTHORS Kalnine,N., Chen,X, , Rolfs, A,, Halleck,A. , Hines.L., Eisenstein, S . , 

Koundinya,M. , Raphael, J., Moreira,D., Kelley,T., LaBaer,J., Lin,Y., 

Phelan,M. and Farmer, A, 
TITLE Cloning of human full-length CDSs in BD Creator (TM) System Donor 

vector 
JOURNAL Unpubl i shed 
REFERENCE 2 (bases 1 to 1173) 

AUTHORS Kalnine,N., Chen,X., Rolfs, A., Halleck^A. , Hines,L., Eisenstein, S . , 

Koundinya,M. , Raphael, J., Moreira,D., Kelley,T. , LaBaer,J., Lin,Y., 

Phelan,M. and Farmer, A. 
TITLE Direct Submission 

JOURNAL Submitted (13-MAY-2003) BD Biosciences Clontech, 1020 East Meadow 
Circle, Palo Alto, CA 94303, USA 
COMMENT This CDS clone is a part of a collection of human full length 
expression clones generated by BD Biosciences Clontech and the 
Harvard Institute of Proteomics. Each CDS has been cloned in two 
forms: with and without stop-codon (to allow fusion with C-terminal 
tag). The CDS has been directionally cloned using BD In-Fusion (TM) 
cloning system between the Sail and Hindi I I sites of the pDNR-DUAL 
vector. Additional sequences in the clone: 'ACC after Sail site 
and before »ATG' to provide Kozak consensus sequence; 'GG' after 
last codon and before Hindi I I site to maintain reading frame. 
Clone distribution: http : //bioinfo . clontech. com/orf clones . 
FEATURES Locat ion/Qual i f iers 

source 1, .1173 

/organ ism= "Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="GH00247Xl .0" 

/clone_lib="BD Creator (TM) CDS Library derived from MGC 

collection" 

/lab_host="DH5alpha Tl resistant" 
/note= "Vector : pDNR-Dual " 
CDS 1. .1173 

/codon_start=l 

/product^" transforming growth factor, beta 1 
(Camurati-Engelmann disease)" 

/protein_id="AAP35909 .1" 
/db_xref ="GI : 30583329" 

/translation="MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNE I YDKFKQSTHS I YMFFNTS ELREAVPEP VLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCP Y i WSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 

BASE COUNT 238 a 380 C 352 g 203 t 

ORIGIN 

Query Match 79.7%; Score 954.6; DB 9; Length 1173; 

Best Local Similarity 88.5%; Pred. No. 6.3e-164; 

Matches 1057; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


wy 




Db 


721 


Qy 


781 


Db 


781 


Qy 


841 



ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III MM II lllllllllll III lllllllll IIMMIMIIIII II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 6 0 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

IIIIMIIIIIIIIIMIIIIM Mill llllllllllllll IIIIIIIIIIIIIII 

CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 120 



II 



AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

llllllllllllllllllll IMIIIIIIIIIIIIIIMI Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 240 



MMMII IIIIIIIIMIMI MIIIMI MM IIIIIIIIIIIIMIII III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

II MIIIMIIMMMIIIIIIIMMMIIIIIIIIMIIIII I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill II llllll I I llllll llllllll I MIIIMI II IIIIM 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGC 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

M MMMII II Mill II MIIIMI llllllllllllll llllllllllll 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

IMIIIMIIIIIIMIIIIIIMIIIII IIIIIIIMMIIIIII IIIIIIIIM 



TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIIIIIIIIMIIIIIIIIIII Mill Mill II llllll I II lllllllll 

TACCTCAGG^CCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 600 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIIIIIIIIIIMIIIIIIII III III Ml II II Mill llllllll M 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 660 



lllllll MM MIIIMI lllllllllll II Mill II IIIIIIMII I 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

I MIIIMI IIIIIIIMIIIIIIIIMI IIIIIIIIIMIII MIIIMI III 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 
ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 

MMMMMMIMMMIMIMMM Mill MIIMIIIMIIIIIM 



841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 



Db 83 5 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 876 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIIIMIIIIIillll llllllllllllll llllllll IIIMIIIIIIMI II 

Db 877 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 936 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

II IMIIIIIIIIMMIIMI llllllll llllllll llllllll llllllll 

Db 937 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 996 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

lllll llllllllllllllllllll llllllllllllll lllilllllil llllll 
Db 997 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1056 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

IIIIIIIIIIIMIIIIMIIIIIIIIIIIII lllllllllllllllllllllllllll 
Db 1057 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1116 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCT 1195 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII Mllllllllllllll 
Db 1117 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCT 1171 



Search completed: October 27, 2003, 18:34:42 
Job time : 4678.52 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 

OM nucleic - nucleic search, using sw model 

Run on: October 27, 2003, 10:47:27 ; Search time 359.617 Seconds 

(without alignments) 
8985.201 Million cell updates/sec 

Title: US-10-017-372E-8 
Perfect score: 1197 

Sequence: 1 atggcgccttcggggctgcg gttcctgcaagtgcagctga 1197 

Scoring table: IDENTITY^NUC 

Gapop 10.0 , Gapext 1 . 0 

Searched: 2552756 seqs, 1349719017 residues 

Total number of hits satisfying chosen parameters: 5105512 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



N_Geneseq__19Jun03 : * 
1 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1980 . DAT: * 



2 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1981 .DAT: * 

3 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1982 , DAT: * 

4 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1983 .DAT: * 

5 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1984 . DAT: * 

6 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1985 . DAT: * 

7 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1986 .DAT: * 

8 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1987 .DAT: * 

9 : /SIDSl/gcgdata/geneseq/geneseqn-enabl/NA1988 .DAT: * 

1 0 : /SIDSl /gcgda ta /geneseq/geneseqn-embl /NAl 98 9 . DAT : * 

11: /S I DS 1 /gcgda ta /geneseq/geneseqn - embl /NAl 990. DAT : * 

12 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1991 .DAT: * 

13 : /SIDSl/gcgdata/9eneseq/geneseqn-embl/NA1992 .DAT: * 

14 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1993 . DAT : * 

15 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1994 . DAT: * 

16 ; /SIDSl/gcgdata/geneseq/geneseqn-erTibl/NA1995 .DAT: * 

17 : /SIDSl/gcgdata/geneseq/geneseqn-embl /NA1996 . DAT : * 

18 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1997 .DAT: * 

19 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1998 .DAT: * 

20 : /SrDSl/gcgdata/geneseq/geneseqn-embl/NA1999 .DAT: * 

21 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2000.DAT: * 

22 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2001A.DAT: * 

23 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2001B.DAT: * 

24 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2002 . DAT: * 

25 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2003 .DAT: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAD22696 

ID AAD22696 standard; cDNA; 1326 BP. 
XX 

AC AAD22696; 
XX 

DT 26-FEB-2002 {first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) cDNA. 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 
KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 
KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 
KW diabetes mellitus; sarcoidosis; psoriasis; dermatological ; ss. 
XX 

OS Sus scrofa . 
XX 

FH Key Location/Qualifiers 
FT CDS 16., 1188 

FT /*tag= a 

FT /product^ "Porcine TGF-betal mutant protein" 

XX 

PN WO200181404-A2 . 
XX 

PD Ol-NOV-2001 . 
XX 

PF 20-APR-2001; 2001WO-US12980 . 
XX 

PR 20-APR-2000; 2000US- 1990 14P . 
XX 



PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ; 
XX 

DR WPI; 2002-026155/03. 

DR P-PSDB; AAE13596. 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 

PT disease in humans, comprises vector containing transforming growth 

PT factor-beta under the control of inducible promoter 
XX 

PS Claim 1; Fig 1; 78pp/ English. 
XX 

CC The invention relates to a composition containing a vector comprising a 

CC gene encoding a regulatory transcription factor under the control of a 

CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vector is useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transf acting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease (IBD) , under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered using a delivery system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-10 by the host. The 

CC composition is useful for treating various diseases with an autoimmune 

CC component such as multiple sclerosis, rheumatoid arthritis, systemic 

CC lupus erythematosus, insul in -dependent diabetes mellitus, sarcoidosis 

CC and psoriasis, and also for assaying the expression of a gene in a cell. 

CC The vector is further useful for screening of the effect of test 

CC compounds on cytokine (e.g. TGF-beta) expression of transf ected cells. 

CC The present sequence is a cDNA encoding porcine TGF-betal mutant. 

XX 

SQ Sequence 1326 BP; 263 A; 438 C; 392 G; 233 T; 0 other; 

Query Match 94.9%; Score 1135.8; DB 24; Length 1326; 

Best Local Similarity 97,8%; Fred. No. 3.7e-231; 

Matches 1171; Conservative 0; Mismatches 2; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

IIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIlllllllllllllllllllllllll 
Db 16 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 75 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIII 

Db 76 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 135 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

IIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII III 
Db 136 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 135 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IMMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII llllllllllll 

Db 196 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 255 



Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

IIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 256 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 315 



Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 


Db 


316 


lllllllllllllllllllllMIIIIIIIIMIMIIIIIIillllllllllllllMI 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


375 


Qy 


361 


TATGATAAATTCAAGGGGlCCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 


Db 


376 


MIIMIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIMIIIMIIMIIIIIM 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


435 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

llllMlllllllllllllllllillllMlllllllMMlllllllllllllllllll 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 


Db 


436 


495 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 


Db 


496 


IMIMIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIMIIIIII 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


555 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


600 


Db 


556 


IIIIIMIMIIIIIMIIIIIMIIMIIIIillllllllMIIIMIIIIIMIIIII 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


615 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


660 


Db 


616 


IMMIIIIIIIIIMIIIIMIIIIIIIIMIIMIIIIIIIIIIIIIlllllllllll 

ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


675 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIII 
GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 


720 


Db 


676 


735 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


780 


Db 


736 


MIIIIMIMIIIIIIMIIMIIIIIMMMIMIIIIMIIMIMIMIIMMI 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


795 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 

IIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 
ATGGCCACCCCGCTGGAGAGGGCCCAGa^CCTGCyvCAGCTCCCGGCACCGCCGA 


840 


Db 


796 


849 


Qy 


841 


AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 


900 


Db 


850 


IIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIII 

GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 


891 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 


960 


Db 


892 


MIIIIIIIIIIIMIIIIIIIIIIIIMMIIilllllllllllllMIIIMIIMM 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 


951 


Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 I M 1 i I 1 1 1 1 1 1 1 1 1 1 1 1 1 11 


1020 


Db 


952 


M M 1 1 1 1 1 M 1 1 M M 1 M 1 M M M 1 1 M M M M 1 M 1 M M 1 M M M 

GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 


1011 


Qy 


1021 


GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 

lllllllllllllllllllllllillllMllllllllllllllllllllllllllllll 
GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 


1080 


Db 


1012 


1071 


Qy 


1081 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 


1140 


Db 


1072 


IIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIMM 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 


1131 



Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIMIMIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIMIIIIIIIII 

Db 1132 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1188 

RESULT 2 
AAI58342 

ID AAI58342 standard; cDNA; 2742 BP. 
XX 

AC AAI58342; 
XX 

DT 22-OCT-2001 (first entry) 
XX 

DE Human polynucleotide SEQ ID NO 545. 

XX 

KW Human; nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 

KW peripheral nervous system; neuropathy; central nervous system; CNS; 

KW Alzheimer's; Parkinson's disease; Huntington's disease; haemostatic; 

KW amyotrophic lateral sclerosis; Shy-Drager Syndrome; chemotactic; 

KW chemokinetic; thrombolytic; drug screening; arthritis; inflammation; 

KW leukaemia; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200153312-A1. 
XX 

PD 26-JUL-2001. 
XX 

PF 26-DEC-2000; 2000WO-US34263 . 
XX 

PR 21-JAN-2000; 2000US-0488725 . 

PR 25-APR-2000; 2000US- 0552317 . 

PR 09-JUL-2000; 2000US- 0598042 . 

PR 19-JUL-2000; 2 00 OUS - 062 03 12 . 

PR 03-AUG-2000; 2000US-0653450 . 

PR 14-SEP-2000; 2 OOOUS- 0662 19 1 . 

PR 19-OCT-2000; 2000US-0693036 . 

PR 29-NOV-2000; 2000US-0727344 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Asundi V, Chen R, Ma Y, Qian XB, Ren F, Wang D; 

PI Wang J, Wang Z, Wehrman T, Xu C, Xue AJ, Yang Y, Zhang J; 

PI Zhao QA, Zhou P, Goodrich R, Drmanac RT; 
XX 

DR WPI; 2001-442253/47. 

DR P-PSDB; AAM39186. 
XX 

PT Novel nucleic acids and polypeptides, useful for treating disorders 

PT such as central nervous system injuries - 

XX 

PS Claim 1; SEQ ID NO 545; 10078pp; English. 
XX 

CC The invention relates to human nucleic acids (AAI57798-AAI61369) and 

CC the encoded polypeptides (AAM3 8642 -AAM42213) with nootropic, 

CC immunosuppressant and cytostatic activity. The polynucleotides are useful 

CC in gene therapy. A composition containing a polypeptide or polynucleotide 



CC of the invention may be used to treat diseases of the peripheral nervous 

CC system, such as peripheral nervous injuries, peripheral neuropathy and 

CC localised neuropathies and central nervous system diseases, such as 

CC Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 

CC lateral sclerosis, and Shy-Drager Syndrome. Other uses include the 

CC utilisation of the activities such as: Immune system suppression, 

CC Activin/inhibin activity, chemotactic/chemokinetic activity, haemostatic 

CC and thrombolytic activity, cancer diagnosis and therapy, drug screening, 

CC assays for receptor activity, arthritis and inflammation, leukaemias and 

CC C.N.S disorders. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification. 

XX 

SQ Sequence 2742 BP; 526 A; 938 C; 799 G; 479 T; 0 Other; 

Query Match 8 0.1%; Score 958.2; DB 22; Length 2742; 

Best Local Similarity 88.6%; Pred. No. 1.6e-193; 

Matches 1060; Conservative 0; Mismatches 113; Indels 24; Gaps 1; 

ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III INI II IIIMIIIIII lllllllllllll IIMIIIIIIIIII II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 
CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

lllllllllllllllllllllll Mill IIMIIIIIIIIII lllllllllllllll 

CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

IIIMIIIIIIIIMIIIIIIIIMI Mlllllllll lllllllllll Mill III 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIIIIIIIIIIMIIIIII IIIIIMIIIIIIIMIIII Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

MIIIIM Mllllllllllll llllllll MM MIIMIIIIIIlllll III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

M MMMMMMMMIMMMIIMIMMIMMMMM I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 1201 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill II IIIIM I I llllll IIIIIIM I IIIIMII II IMIII 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 1261 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

M 1 1 MUM II Mill II llllllll llllllllilllll II MM I III M 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 1321 



Qy 


1 


Db 


842 


Qy 


61 


Db 


902 


Qy 


121 


Db 


962 


Qy 


181 


Db 


1022 


Qy 


241 


Db 


1082 


Qy 


301 


Db 


1142 


Qy 


361 


Db 


1202 


Qy 


421 


Db 


1262 


Qy 


481 


Db 


1322 


Qy 


541 



lllllllllllllllllllllll Mill Mill II llllll I II lllllllll 



Db 


1382 


TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


1441 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


660 


Db 


1442 


lllllllllllllllllllll III III III II II Mill llllllll II 
ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


1501 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTT^ 

IMIIII MM IIMIIM IMIIMIIM II Mill II MIIIIIIM 1 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


720 


Db 


1502 


1561 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


780 


Db 


1562 


1 MIMIII IIIIIMIIIIIIIMMM IIIIIMIIIIIII IMIMII Ml 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


1621 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 


840 


Db 


1622 


IIIIIIIIIIMIIIIMIIIIIIIIIII Mill IIIIIIIIIIMIIIMI 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 


1675 


Qy 


841 


AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 

IIIIIIM llllllll IIIIMIIIIIIIIIIIIIIIIII 

GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 


900 


Db 


1676 


1717 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

IMIIMIIIIIMIII llllllllllllll IIIIMII IIIMIIIIIIIII M 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


960 


Db 


1718 


1777 


Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 

II IIIIIIIIIIIIMIIIIM IIMIIM IIIIMM llllllll MIMIII 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 


1020 


Db 


1778 


1837 


Qy 

UD 


1021 

1 Q '3 Q 


GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 

Mill IIIMIMMIIMIIIMI MIMMIIMIII IIIMIIIIII MUM 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 


1080 
1897 


Qy 


1081 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 


1140 


Db 


1898 


IIIIIIIIMIIIIIIIIIIMIIIIIMIII lllllllllllllllllllllllllll 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 


1957 


Qy 
Db 


1141 
1958 


AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIIMIIMIIIIIIIIIIIIIMIIMIIII IIIIMIIIIMIIIIM 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2014 



RESULT 3 
AAQ2028 9 

ID AAQ20289 standard; cDNA; 1559 BP. 

XX 

AC AAQ20289; 
XX 

DT 25-MAR-2003 (updated) 

DT 16-APR-1992 (first entry) 

XX 

DE Sequence encoding simian transforming growth factor (TGF) beta-1. 
XX 

KW Hypertension therapy; hypotensive agent; blood pressure modulator; 

KW ss . 

XX 



Uo 






Y Y 






r n 


ivcy 


T .f^/^a t~ H r>T) 1 On a 1 H f* i ova 
i-iLJv_cl L. v^UdX i. i. J. CI. o 


r I 




o £r o o Q o 


r i 




/ Lag= a 


r 1 


Sly pepuiue 


TOT 7.0 A 


r J. 




/ ■* t- ;a o — H 

/ '-ay— u 


r i 




J Z D . . ± U -/ O 


r I 




/ " tag— c 


r i 


tudu pepciae 




r J. 




/ Ldy— U 


Y Y 
AA 








W09119513-A. 




AA 






irU 


26-DEC-1991. 




V Y 

AA 






IT r 


20-JUN-1991; 


91WO-US04449. 


Y Y 
AA 






DD 

rK 


20-JUN-1990; 


90US-0541221 . 


YY 
AA 






pa 

rr\. 


(BRIM ) BRISTOL-MYERS SQUIBB CO. 


YY 
AA 








Oleson FB, 


Comereski CR; 


Y Y 
AA 






■no 

UK 


WPI; 1992-024199/03. 


UK 


P-PSDB; AAR20124. 


Y Y 
AA 






DT 
r L 


Use of transforming growth factor (TGF) -beta and their 


r 1 


antagonists 


- for modulating blood pressure, for treating 


PT 

XT 1 


hypertension 


and hypotension 


Y Y 
AA 






IT O 


Disclosure; 


Fig 1; 42pp; English. 


YY 
AA 








A new method 


for treating hypertension comprises administering a 




transforming 


growth factor (TGF) -beta to an individual at a dose 


cc 


effective for lowering blood pressure; the TGF-beta may be e.g. 


cc 


mature TGF-beta, TGF-beta2, a mature TGF-betal/beta2 hybrid, TGF- 


cc 


betal precursor, a latent TGF-beta2 precursor, hybrid TGF-betal/TGF- 


cc 


beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 


cc 


complex. 




cc 


(Updated on 


25-MAR-2003 to correct PA field.) 


XX 






SQ 


Sequence 1559 BP; 300 A; 546 C; 446 G; 267 T; 0 other; 



Query Match 79.9%; Score 956.6; DB 13; Length 1559; 

Best Local Similarity 88.5%; Pred. No. 3.3e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM M MIIIMMM IIIIIIMIIIII IIIIIIMIIIIII II III 

Db 261 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

Illllllll lllllllllllll lllll IMIIIIIIIIIII MMMIIIIIMM 

Db 321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 380 



Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 







1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 MM MMMMMI 1 M M M M M MMI III 




Db 


381 


GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 


440 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 


240 


Db 


441 


IIIIIMIIIIIMIIMII IMIMIIIMIIIIMIII MMI II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


500 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 




II II M II llllllllllllll 1 II M 1 II MM II MMMMMI II III 




Db 


501 


TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 


560 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 




II IIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIMMIIIII 1 MM Mill 




Db 


561 


GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


620 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 




Mill II llllll 1 1 1 Mini IIIMIM 1 llllllll II MUM 




Db 


621 


TATGACAAGTTCAAGCAGAGCACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 


680 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 




M Mill II II llllllll llllllll llllllllllllll MIIMIlllll 




Db 


681 


CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


740 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 




Illllllllll llllllll llllllll lllllllllllllllll llllllllll 




Db 


741 


AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


800 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


600 




MIIIIIIIIIIIIIIIIIIMI Mill MM lllllllll MM lllllllll 




Db 


801 


TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 


860 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


660 




Illllllllllllllllllll III MM III II II Mill llllllll II 




Db 


861 


ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


920 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 


720 




lllllll MM IIIIIIIIIIIIIIIIIIIM II Mill II llllllllll 1 




Db 


921 


GCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGTTC^ 


980 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


780 




1 llllllll MMMIMMIM Mill llllllllllllll IMIIIM III 




Db 


981 


ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


1040 



Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIMIIMIIIIIIIIIIIMIIIII II Mill IIIMIIIIIIIIIIIM 

Db 1041 ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA 1094 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

IIIIMII IIIIIIIIMMIIIMIIIIIMlllllllll 

Db 1095 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 1136 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIMMMMIMIMI II MMMMMI IMIIIM IIIIIIIIIIIIM II 

Db 113 7 TGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1196 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II IIMIIIIIIIIIIIIIIII Illlllllllllllllllllllllll llllllll 



Db 



1197 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTG 1256 



Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACTVACCCGGGCGCGTCGGCG 108 0 

Mill IIIIIIIIIIIIMIIIMI llllllllllllll lllllllllll mill 

Db 1257 GACACGCAGTACAGCTU^GGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTC 1316 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

II II III II llllllllllllllllllll Mil II llllllll III III III III III II 

Db 1317 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1376 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMII llllllll lllllllll 

Db 1377 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAATGCAGCTGA 1433 



AAQ03268; 

25-MAR-2 003 (updated) 
12-AUG-1990 (first entry) 

Simian transforming growth factor-beta cDNA. 

Transforming growth factor-beta; psoriasis; TGF-beta; ss. 
Monkey . 



RESULT 4 
AAQ03268 

ID AAQ03268 standard; DNA; 1561 BP. 
XX 
AC 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 



Key 

sig_pept ide 
mat peptide 



EP353772-A. 
07-FEB-1990 . 

04- AUG-1989; 

05- AUG-1988; 



Loca t ion/ Qua 1 i f i er s 
283, .324 
/*tag= a 
1096. .1431 
/*tag= b 

/product =human transforming growth factor-beta 



89EP-0114458- 
88US-0229133. 
(ONCO ) ONCOGEN LP. 

Twardzik DR, Purchio AF, Ranchalis JE, Stevens V; 

WPI ; 1990-038499/06. 
P-PSDB; AAR03743. 

Inhibition of proliferation of epidermal cells - 
used to treat psoriasis by contacting cells with compositions 
containing transforming growth factor-beta. 

Disclosure; fig 1; 2 0pp; English. 



XX 

CC TGF-beta may be used in the treatment of hyperplasia 

CC associated with acanthosis-categorised skin diseases, and 

CC in alleviating psoriatic symptoms associated with cytokine- 

CC induced phenomena. See also AAQ03269 and AAR03750. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

XX 

SQ Sequence 1561 BP; 301 A; 547 C; 446 G; 267 T; 0 Others- 
Query Match 7 9.9%; Score 956.6; DB 11; Length 1561; 
Best Local Similarity 88.5%; Pred. No. 3.3e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 

ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

III MM II MIIMMIII MIIIMIIIIII Mllllllllllll II Ml 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 
CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 

lllllllll Mlllllllllll Mill IIIIIMIIMIII IIIIIIIIMIIIII 

CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

MMMMIMMMIMIM MM Mlllllllll MIIIIMIM lllll III 

GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

IMIIIIIIIIIIIIIMII IIIIIIIIIIIIIIIIMII lllll II II II M 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

IIMIIII MIIIIIIIIIIM llllllll MM II IIIMMIIII II III 



GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II MIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMII I llll Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 621 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill II Mill! I I I llllll IIIMMI I IMMMI M MMII 

TATGACAAGTTCAAGCAGAGCACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 681 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

II Mill II II IMMMI IMMMI llllllllllllll llllllllllll 

CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 1 
AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IMMIIIIIIIIIIIIIII IMMMI IIIIIIIIIIMMIII IIIMIIIII 

AAGTTAAAAGTGGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 8 01 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

MMIIMIIIIIIIIIIIIIM Mill MM lllllllll llll lllllllll 

TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 861 

ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

MMMMMMMMIMM III MM III II M lllll llllllll II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 921 



Qy 


1 


Db 


262 


Qy 


61 


Db 


322 


Qy 


121 


Db 


382 


Qy 


181 


Db 


442 


Qy 


241 


Db 


502 


Qy 


301 


Db 


562 


Qy 


361 


Db 


622 


Qy 


421 


Db 


682 


Qy 


481 


Db 


742 


Qy 


541 


Db 


802 


Qy 


601 


Db 


862 



Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

IIIMII MM MMMMMMMMMMI M MMI M MMMMM 1 

GCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGTTC^ 


720 


Db 


922 


981 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 MMMM MMMMMMM MMI MMMMMMM MMMM Ml 

ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


780 


Db 


982 


1041 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 


840 


Db 


1042 


MMMMMI MMMIMMMI M IMM MMMMMMMMM 

ATGGCCACCCCACTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA 


1095 


Qy 


841 


AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 

MMMM MMIIIIMMIIIIIIMIIIIIIIIIIIII 

GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 


900 


Db 


1096 


1137 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

IMMMMIMIIIII M MIMIMIII MMMM IMMMIIIMII II 

TGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


960 


Db 


1138 


1197 


Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 


1020 


Db 


1198 


M MMIMMMIMMMM M M 1 M M M M M M M II M M 1 IMMIM 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTG 


1257 


Qy 


1021 


GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 

Mill IIMIIMIMIIIIIIIII MMMMMMM IIMIIIIIII MUM 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 


1080 


UD 


1258 


1317 


Qy 


1081 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 

III II III II MMMMM II MMMMMI IIMIIIIIIIIMIIIII MMMM 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 


1140 


Db 


1318 


1377 


Qy 


1141 


AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IMMIMIIIIIIIIIIMMMIMIMMIMIII MMMM IIIIIIIM 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAATGCAGCTGA 1434 


Db 


1378 



RESULT 5 




AAQ13392 




ID 


AAQ13392 standard; DNA; 1821 BP. 


XX 






AC 


AAQ13392; 




XX 






DT 


20-NOV-1991 


(first entry) 


XX 






DE 


Human pro-TGF 


-beta 1 gene. 


XX 






KW 


Osteogenetic; 


tumoricidal; ss. 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Location/Qualifiers 


FT 


CDS 


512 . . 1684 


FT 




/*tag= a 


FT 


sig_peptide 


512 . . 598 


FT 




/*tag= b 


FT 


misc^RNA 


599. .1684 



FT /*tag= c 

FT /note= "pro-TGF-beta 1" 

FT mat_peptide 1346.. 1684 

FT /*tag= e 

FT /note= "TGF-beta 1" 

XX 

PN JP03180192-A. 
XX 

PD 06-AUG-1991. 
XX 

PF 07-DEC-1989; 8 9 JP - 03 1824 3 . 
XX 

PR 07-DEC-1989; 8 9 JP- 03 18243 . 
XX 

PA (KIRI ) KIRIN BREWERY KK. 
XX 

DR WPI; 1991-271579/37. 
DR P-PSDB; AAR13813 . 
XX 

PT Human pro-TGF-beta 1 prodn., for osteo-genetic activity - by 
PT preparing DNA chain contg. base sequence coding for human 
PT pre : pro-TGF-beta 1, forming expression vector etc. 
XX 

PS Claim 1; Fig 1; 16pp; Japanese. 
XX 

CC The DNA sequence encodes human prepro-TGF-beta 1 which can be 
CC produced by recombinant methods, it has osteogenetic and 
CC tumoricidal activity. 
XX 

SQ Sequence 1821 BP; 326 A; 679 C; 508 G; 308 T; 0 others- 
Query Match 79.9%; Score 956.6; DB 12; Length 1821; 
Best Local Similarity 88.5%; Pred. No. 3.3e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM II IIMIMMM IIIIIMIMIII IIIIMIIMIMI II III 

Db 512 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 571 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

MIIIIIIIIMI IMIIIIII Mill IIMIIMIIMII MIIIIMIIIIIII 

Db 572 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 631 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

IIMIIIIMIMIIMIIIMMM IMIMIIIII MIIIIMIII Mill III 

Db 632 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTGGCC 691 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIIIMIIIIIIIMIMI IMIIIIIIMIIMIIMI Mill II M II II 

Db 692 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 751 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IMIIIII IIIIIIMIIIIII IMIIIII MM MIMIIMIIIIIIII III 

Db 752 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 811 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II IMIMIMIIIIMIIIIIMIIIIIIIMIIIIIIIIIMI I MM Mill 



Db 812 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 871 

Qy 3 61 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill II mill I I iiiiii llllllll I llllllll II mill 

Db 872 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 931 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

II llllllll M mil II iimiii llllllllllllll miiiiiiiii 

Db 932 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 991 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

Illllllllllllllllllllllllllll iiiiiiimimiii iiiiiiiiii 

Db 992 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 1051 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

iiiiiiiiiimiiiiiiiiii mil mil ii mm i ii iiimiii 

Db 1052 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1111 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

iiiiiiiiiimiiiimi III III III II II mil iiiiiiii ii 

Db 1112 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1171 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

Illllll MM IIMIIII lllllllllll II Mill II IIIIIIIIII I 

Db 1172 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1231 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I llllllll iiiiiiiiiiiiiiiimi mmMiMMi iiiiiiii mi 

Db 1232 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1291 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIIIIIIIIIIIIIIIIIIIIMIIMI mil IMIIMIIIIIIIIIII 

Db 1292 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 1345 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

llllllll llllllll llllllllllllllllllllllll 
Db 1346 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1387 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIIIIIIIIIIMIII Mllllllllllll MMMM IIIMIIIIIIIII II 

Db 1388 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1447 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II llllllllllllllllllll IIIIIMI IIIMIII IIIMIII llllllll 

Db 1448 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1507 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

Mill IIIIMIIIIIIIIIIIIII llllllllllllll lllllllllll IIIIII 

Db 1508 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1567 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

MMMMMMMMMMMMMMMM IIIIIIIIIIMIIIIIIIIMIIIII 

Db 1568 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1627 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IMIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIII IIIMMIIMIIIIIII 

Db 1628 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1684 



RESULT 6 
AAN60972 

ID AAN60972 standard; cDNA; 2537 BP. 
XX 

AC AAN60972; 
XX 

DT 31-OCT-2002 (updated) 

DT 28-OCT-1991 (first entry) 

XX 

DE Sequence encoding preTGF-beta. 
XX 

KW Transforming growth factor beta; cancer; wound healing. 
XX 

OS Unidentified, 
XX 

FH Key Location/Qualifiers 
FT misc_structure 37 . . 113 
FT /*tag= a 

FT /note= "Sequence can form stable hairpin loops" 

FT CDS 842. .2014 

FT /*tag= b 

FT mat_peptide 1676.. 2011 

FT /*tag= c 

XX 

PN EP200341-A. 
XX 

PD lO-DEC-1986. 
XX 

PF 21-MAR-1986; 86EP-0302112. 
XX 

PR 22-MAR-1985; 85US- 0715 142 . 
PR 13-MAR-1987; 87US-0025423 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA; 
XX 

DR WPI; 1986-326875/50. 
DR P-PSDB; AAP61468. 
XX 

PT TGF-beta prodn. from transformed hosts - useful esp. for treating 

PT wounds (J6 2/9/86) . 

XX 

PS Disclosure; Fig lb; 26pp; English. 
XX 

CC The gene product is known to stimulate cell proliferation and 
CC inhibit anchorage -dependent growth of a variety of human cancer cell 
CC lines, it is esp. useful in treatment of burns and the promotion of 
CC surface and internal wound healing. TGF-beta may be expressed from a 
CC transformed CHO cell line. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 Other; 



Query Match 



79.9%; Score 956.6; DB 7; Length 2537; 



Best Local Similarity 88.5%; Pred. No. 3.5e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 

ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III MM II lllllllllll IIIIIIIIMIM IIIIIIIIIIIIM II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

lllllllllllll lllllllll lllll IIIIIIIIIIIMI IIIIIIMIIIIIII 
CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

llllllllllllllllllllllllll MMIIIIIII lllllllllll lllll III 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

llllllllllllllllllll llllllllllllllllllll lllll II II II II 
AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCX5TGCTCGCCCTG 1081 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IMIIIII Mllllllllllll llllllll MM lllllllllllllllll III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMI I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 1201 
TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

lllll II llllll I I Mini llllllll I llllllll II MUM 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 1261 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 48 0 

II llllllll II Mill M llllllll llllllllllllll llllllllllll 
CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 1321 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIIIIIIIIIIIIIIIMIIIIIIIMII IIIMMIIIIIIIIM IMMIIIII 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 13 81 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIIIIIIIIIIIIIIIIMIII Mill lllll II llllll I II lllllllll 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 

ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 
ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 15 01 
GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 720 

Illllll MM llllllll MMIIIIIII II lllll II llllllllll I 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC^ 1561 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

I IIIIMII llllllllllllllllllll IIIMIIIMIIII llllllll III 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 
ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIMIIIIIIIMIIIIIIMIIIIIII Mill IIMIIIIIIMIIIIII 



Qy 


1 


Db 


842 


Qy 


61 


Db 


902 


Qy 


121 


Db 


962 


Qy 


181 


Db 


1022 


Qy 


241 


Db 


1082 


Qy 


301 


Db 


1142 


Qy 


361 


Db 


1202 


Qy 


421 


Db 


1262 


Qy 


481 


Db 


1322 


Qy 


541 


Db 


1382 


Qy 


601 


Db 


1442 


Qy 


661 


Db 


1502 


Qy 


721 


Db 


1562 


Qy 


781 



Db 



1622 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 1675 



Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

llllllll llllllll IIIIIIMIIIIIIIIIIIIIIII 
Db 1676 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1717 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIIIIIIIIMIIIII IIIIIMIIIIIII llllllll IIIIIMIIIIIII II 
Db 1718 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

M IIIIMIIIIIIIIMIIII llllllll Mllllll llllllll IIIIIMI 

Db 1778 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 183 7 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill IIMIIIIIIIIIIMIIII IMIIIIIIIIIII IIMIIIIIII llllll 

Db 1838 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 18 97 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

IMIIMIIIIIIIIIIIIIIIMIIIIIIII IIMIIMIIIIIIIIIMIIIIIIII 

Db 18 98 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII llllllllllllllllll 
Db 1958 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2014 

RESULT 7 
AAQ03301 

ID AAQ03301 standard; DNA; 2537 BP. 

XX 

AC AAQ03301; 
XX 

DT 25-MAR-2003 (updated) 

DT 05-AUG-1990 (first entry) 

XX 

DE cDNA encoding human pre-transf orming growth factor-beta-1 (pre-TGF-beta- 

DE 1) . 
XX 

KW Transforming growth factor-beta-1 (TGF-beta-1) ; 

KW neoplastic cell line inhibition; 

KW EGF-potentiated anchorage- independent growth; 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 842 . .2014 

FT /*tag= a 

FT mat_peptide 1676.. 2011 

FT /*tag= b 

FT misc_dif f erence 37.. 113 

FT /*tag= c 

FT /note=" stable hairpin loops" 

FT misc_feature 2015.. 2100 

FT /*tag= d 

FT /note= "G-C rich sequence 

FT and a downstream TATA- like sequence" 



XX 

PN US4886747-A. 
XX 

PD 12-DEC-1989. 
XX 

PF 13-MAR-1987; 87US- 0025423 . 
XX 

PR 13-MAR-1987; 87US- 0025423 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 

XX 

DR WPI; 1990-051338/07. 

DR P-PSDB; AAR05258 , 
XX 

PT Nucleic acid encoding transforming growth factor-beta - 

PT cloned into expression vectors for expression in eukaryotic host 

PT cells for therapeutic use 
XX 

PS Disclosure; Fig lb; 28pp; English. 
XX 

CC It was obtained by an analysis of several overlapping cDNAs and gene 

CC fragments, leading to the detn. of a continuous sequence corresp. to the 

CC TGF-beta-1 precursor mRNA. It is useful in constructing vectors that 

CC encode biologically active transforming growth factor (TGF-beta) , 

CC operably linked to DNA that encodes a secretory leader (SL) . It, or a 

CC nucleic acid capable of hybridising with it, can also be labelled and 

CC used in diagnostic assays for DNA or mRNA encoding TGF-beta or related 

CC proteins. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 

Query Match 79.9%; Score 956.6; DB 11; Length 2537; 

Best Local Similarity 88.5%; Pred. No. 3.5e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM M IIIIIIIIIM IIIIIIMIIIII IIIIIIIIIMIII II III 

Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

MiiliilllM! Illllllll IIIM IIIIIIIIIMIII lllllllllllllll 

Db 902 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

IMMMIMIMIMIIIIMMM IIIIMMIII IIIIIIIMII Mill Ml 

Db 962 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

MMMMMMMMMM llllllllllllllllllll Mill II II II II 

Db 1022 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

Illlllll IIIIIIIIMMII MMIMI MM lllllllllllllllll III 

Db 1082 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 



Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II IIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII I Mil Mill 

Db 1142 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 12 01 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

Mill II IIMII I I llllll llllllll I llllllll II llllll 

Db 1202 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAAO^CATC^^ 12 61 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 48 0 

II llllllll II Mill II IMIIIII MIIIIMIIIIII IIIIIIIMIII 

Db 1262 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 1321 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIIIIIMIIIIIIIMIIIIIIIIIIII lllllllllllllllll llllllllll 

Db 1322 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 13 81 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IMIMIIIIIIIIIIIIIIIII Mill Mill II Mini I II lllllllll 

Db 1382 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIIIIIIMIIIIIIIIIIII III III III II II Mill llllllll II 

Db 1442 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1501 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

lllllll MM IIIIIIM IIIIIMIIII II Mill II IIIIMIIII I 

Db 1502 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1561 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I llllllll llllllllllllllllllll IIIMIMIIIIM llllllll III 

Db 1562 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIIIMIMIIIIMIIIIIIIMIIII Mill IIIIIMIIIMIIMII 

Db 1622 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 1675 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 90 0 

IMIIIII llllllll IIIIIIIIIIIIIMIIIIMMI 

Db 1676 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1717 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

MIMMMIMIMM IIIIIMIIIIIII llllllll IIIIIIIIIIMM II 

Db 1718 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II IIIIIIIMMIMIIIIM llllllll llllllll IMIIIII IIIIIMI 

Db 1778 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

Mill IIIIIIIIIIIIMIIIIM IIIIIIIIIIIIM IMIIIIMM llllll 

Db 1838 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 18 97 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

llllllllllllllllllllllllllllllll lllllllllllllllllllllllllll 

Db 18 98 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 



Qy 1141 AAGCC(:AAGGTGGAGC^GCTGTCCAAC7^TGATCGTGCGTTCCTGCM.GTGCAGCTGA 1197 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII llllllllllllllllll 

Db 1958 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2014 



AAQ02814; 

25-MAR-2003 
31-OCT-2002 
31-MAY-1989 



(updated) 
(updated) 
(first entry) 



Sequence of pre -TGF -beta 1 cDNA. 

Transforming growth factor beta-3 (TGF beta 3) ; tumour cells; growth 
inhibition. 



RESULT 8 

AAQ02814 

ID AAQ02814 standard; cDNA; 2537 BP. 
XX 
AC 
XX 
DT 
DT 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 

PI 

XX 

DR 
DR 
XX 
PT 
PT 



Homo sapiens . 

Key 
CDS 



CDS 

GC_signal 
misc_f eature 

stem_loop 
misc_f eature 

WO8912101-A. 
14-DEC-1989. 
08-JUN-1988; 
08-JUN-1988; 



Loca t i on / Qua 1 i f i ers 
842 . .2011 
/*tag= a 

/label =pre-TGF beta 1 
1677. .2011 
/*tag= b 

/label =mature TGF-beta 1 
2015. .2092 
/*tag= c 
2093. .2099 
/*tag= d 

/label=TATA-like sequence 
37. .113 
/*tag= e 
863 . . 911 
/*tag= f 

/label =hydrophobic domain 



88WO-US01945. 
88WO-US01945. 



(GETH ) GENENTECH INC. 

Dernyck RMA, Goeddel DV; 

WPI ; 1990-007474/01 . 
P-PSDB; AAR04034, 

Nucleotide sequence encoding transforming growth factor beta-3 -used as a 
probe, or to produce TGF beta 3, for growth inhibition of certain normal 



PT and neoplastic cells, eg A549. 
XX 

PS Disclosure; Fig. lb; 61pp; English. 
XX 

CC Sequence encodes the 390 amino acid (AA) precursor transforming growth 

CC factor-beta 1 (pre-TGF-beta 1) polypeptide. The 5' untranslated region of 

CC the TGF-beta 1 mRNA is 841 bases long, is purine rich and has a region of 

CC potential secondary structure. The TATA-like sequence in the 3' untrans- 

CC lated region of the gene is presumably a polyadenylation signal. Mature 

CC TGF-beta 1 comprises the C-terminal 112 AA's of pre-TGF-beta 1 and is 

CC cleaved at the Arg-Arg dipeptide preceding its NH2 terminus. The nucleic 

CC acid encoding the second subtype of TGF-beta (TGF-beta 3) is useful as a 

CC probe or to produce TGF-beta 3 for inhibition of growth of normal and 

CC neoplastic cells. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 

CC (Updated on 25-MAR-2003 to correct PR field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 Other; 

Query Match 79.9%; Score 956.6; DB 11; Length 2537; 

Best Local Similarity 88.5%; Pred. No. 3.5e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 



Db 




Qy 



61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 



Db 




Qy 



121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 



Db 




Qy 




Db 



Qy 



241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 



Db 




Qy 



301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 



Db 




Qy 



361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 



Db 




Qy 



421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 



Db 




Qy 



481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 540 



Db 1322 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 1381 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

lllllllllllllllllllllll Mill Mill II llllll I II lllllllll 

Db 1382 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIIIIIIIIIIIMIIIIIII III III III II II Mill llllllll II 

Db 1442 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1501 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

MMIII MM IIMIIII lllllllllll II Mill M llllllllll I 

Db 1502 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1561 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I IIIIMII llllllllllllllllllll llllllllllllll IIMIIII III 
Db 1562 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIIIIIIIIIIIIMIIIIIIIMIIII Mill IIIMIIIIIIIIMIII 

Db 1622 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 1675 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

llllllll IIMIIII IIIIIIIIIIIIIIIMMIIIII 

Db 1676 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1717 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIMIIIIIIIMIM llllllllllllll llllllll llllllllllllll M 

Db 1718 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCT^GGACCTCGGCTGGAAGTGGATCCAC 1777 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II IIIIIIIIIIMIIIMIII llllllll llllllll IIMIIII Mllllll 

Db 1778 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill IIIIIIIIIIIIIIMMII IIIIMIIIIIIII IIIIIMIIII llllll 

Db 1838 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTTO 18 97 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCG^TCGTGTACTACGTGGGCCGC 114 0 

IIIIIIIIIIMMIIIIIIIIIIIIIIIIII IIIIIIIIIIIIIIIIIIIIMIIIII 

Db 18 98 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIMIIIIIIMIIIIIMIIIIIIIIIMIIII Mllllllllllllllll 

Db 1958 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2 014 

RESULT 9 

AAT15720 

ID AAT15720 standard; cDNA; 2537 BP. 
XX 

AC AAT15720; 
XX 

DT 25-MAR-2003 (updated) 
DT 24-JUL-1997 (revised) 
DT 25-JAN-1980 (first entry) 



XX 

DE Pre-transforming growth factor beta 1 cDNA. 
XX 

KW transforming growth factor beta 1; wound healing; 

KW recombinant production; ss. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT 5'UTR 1..841 
FT /*tag= a 

FT misc_feature 37.. 113 
FT /*tag= b 

FT /note= "GC-rich region forms stable hairpin loops; 

FT similar to structural organisation of c-myc RNA, 

FT could play role in mRNA stability or in 

FT regulation of transcription" 

FT CDS 842 . .2014 

FT /*tag= c 

FT /product = pre-TGF_beta_l 

FT mat_peptide 1676.. 2011 

FT /*tag= d 

FT /product = mature_TGF_beta_l 

FT repeat_region 2015.. 2100 
FT /*tag= e 

FT /note= "GC-rich region; possibly responsible for the 

FT fact 3'UTR of mRNA could not be cloned as cDNA; 

FT may be important for transcription efficiency" 

FT repeat_unit 2019., 2023 

FT /*tag= f 

FT TATA_signal 2094.. 2100 

FT /*tag= g 

FT /note= "TATA-like sequence; no evidence that this 

FT functions a promoter" 

FT polyA_signal 2514.. 2520 

FT /*tag= h 

FT misc_signal 2529.. 2536 

FT /*tag= i 

FT /note= "consensus sequence immediately precedes 

FT polyA-tail (Benoist et al)" 

XX 

PN US5482851-A. 
XX 

PD 09-JAN-1996. 
XX 

PF 05-NOV-1993; 93US - 0 14 73 64 . 
XX 

PR 13-MAR-1987; 87US- 0025423 . 

PR 22-MAR-1985; 85US-0715142 . 

PR 04-AUG-1989; 89US-0389929 . 

PR 04-MAR-1992; 92US- 084 58 93 . 

PR 05-NOV-1993; 93US- 01473 64 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 



DR WPI; 1996-076891/08, 

DR P-PSDB; AAR90827 . 
XX 

PT New recombinant human transforming growth factor-beta prods. - produced 

PT using Chinese hamster ovary cells, for use in diagnostic applications 

PT or in therapy 
XX 

PS Example 3; Fig 1; 26pp; English. 
XX 

CC The cDNA encodes the pre- transforming growth factor (TGF) beta 1 protein. 

CC The nucleotide sequence was obtd . by an analysis of several overlapping 

CC cDNAs and gene fragments. The DNA is useful for the recombinant 

CC production of TGF beta 1, which can be used in, e.g. wound healing. 

CC (Revised entry submitted to correct sequence analysis breakdown.) 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 



Query Match 79.9%; Score 956.6; DB 17; Length 2537; 

Best Local Similarity 88.5%; Pred. No. 3.5e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM II lllllllllll lllllllllllll llllllllllllll II III 

Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

lllllllllllll iiiiiiiii mil iiMiiiiiiiiii iiiiiiiiiMiiii 

Db 902 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

iiiiiiiiiiiiiiiiiiiiiiiiii lllllllllll lllllllllll mil III 

Db 962 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

llllllllllllllllllll llllllllllllllllllll Mill II II II II 
Db 1022 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

miiiii iiiiimimii iiiiiiii im iiiiiiiimiiiiii iii 

Db 1082 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II iiiiiiiiiimiiiiiiimiiiiiiiiiiiiiiiiiiii I MM mil 

Db 1142 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 1201 

Qy 3 61 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill II mm I I mm miiiii i iiiiiiii ii mm 

Db 1202 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 1261 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

II IIIIIIII M Mill 11 IIIIIIII llllllllllllll IIIIMIIIIII 
Db 1262 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 1321 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 540 

IIIIIIIIIIMIIIIIIIIIIIIIIIII lllllllllllllllll IIIIIIIIM 

Db 1322 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 1381 



Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIIIIIMIIIIIMIIIIIII Mill Mill II llllll I II IIIMMII 

Db 1382 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIIIIIIMIIIIIMIIMI III Ml III II M Mill IIIIIIM II 

Db 1442 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1501 

Qy 661 GCCCAGTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

IIMIII MM llllllll lllllllllll II Mill II IIMIIIIM I 

Db 1502 GCCCACTGCTCCTGTGACAGCAGGGATAACACy^CTGCAAGTGGACATCAACGGGTTCA 1561 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

I llllllll IIIIIIIIIIIMIIIIIII IIIIIIMIMIII llllllll III 

Db 1562 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

Illllllllllllllllllllllllllll Mill llllllllllllllllll 
Db 1622 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 1675 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

IIMIIM IIIIIIM IIIIIIIMIIIMIIIIIIIIM 

Db 1676 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1717 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

lllllllllllllllll llllllllllllll llllllll IIIIIIMIIMII II 
Db 1718 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II IIIIIIIIIIIIIIIIMII IIIIIIM llllllll IMIIMI llllllll 

Db 1778 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

Mill IIIIIIIMIIIIIIIIIII IIIIIMIIIIIII IIIIMIIIII llllll 

Db 1838 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1897 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

IMIIMIMMIIIIIIIIIIIIIIMMII IMIIIIIIMIIIIIIIIIIMIIM 

Db 1898 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIII IIIIIIIMIIIIIIIII 

Db 1958 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2014 



RESULT 10 
ABQ76674 

ID ABQ76674 standard; DNA; 2527 BP. 
XX 

AC ABQ76674; 
XX 

DT 26-MAR-2003 (first entry) 
XX 

DE Androgen receptor signalling pathway-associated DNA E00973. 
XX 

KW Androgen receptor; transact ivat ion; modulator; Smad3; Smad4 ; Akt; TGP-B; 



KW signal transduction pathway; transforming growth factor-B; phosphatase; 

KW tensin; cytostatic; antiproliferative; cellular proliferation; cancer; 

KW E00973; ds . 
XX 

OS Synthetic. 
XX 

PN WO200282081-A2. 
XX 

PD 17-OCT-2002. 
XX 

PF 05-APR-2002; 2 002WO-US1108 6 . 
XX 

PR 06-APR-2001; 2001US-282266P. 

PR 13-MAR-2002; 2002US-365060P . 
XX 

PA (UYRP ) UNIV ROCHESTER. 
XX 

PI Chang C; 
XX 

DR WPI; 2003-046871/04. 
XX 

PT Modulating androgen receptor activity, by administering a compound that 

PT modulates receptor activity, inhibits receptor-signal transduction 

PT pathway/receptor-coactivator interaction or changes amount or receptor 

PT 

XX 

PS Disclosure; Page 225-226; 302pp; English. 
XX 

CC This invention describes a novel method for modulating androgen receptor 

CC activity or androgen receptor-mediated transactivation activity in a 

CC cell. The method involves administering a compound which causes 

CC modulation of the androgen receptors activity and the inhibition of 

CC interaction between the receptor and a protein involved in a signal 

CC transduction pathway. The compound also inhibits the interaction between 

CC the androgen receptor and a protein selected from Smad3, Smad4, Akt, 

CC transforming growth factor (TGF) -B and phosphatase and tensin homologues 

CC deleted on chromosome 10 (PTEN) or their fragments. The compounds of the 

CC invention have cytostatic and antiproliferative activity. The obtained 

CC composition is useful for treating any disease, where uncontrolled 

CC proliferation or cellular proliferation occurs such as cancer, e.g. 

CC prostate cancer. This sequence represents the androgen receptor 

CC transactivation signalling pathway modulator E00973 described in 

CC the method of the invention. 

XX 

SQ Sequence 2527 BP; 472 A; 888 C; 735 G; 432 T; 0 other; 



Query Match 79.8%; Score 955; DB 25; Length 2527; 

Best Local Similarity 88.4%; Pred. No. 7.6e-193; 

Matches 1058; Conservative 0; Mismatches 115; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III Mil II IIMIIIIIII lllllllllllll llllllllllllll II III 
Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 



Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

lllllllllllll II Mil III Mill IMMIIIIIIIII II II I II Mill III 

Db 902 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 



Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

IIIIIIIIIMIIIIIIIIIIIIIM IIIIMIIIII lllllllllll Mill III 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 


180 


Db 


962 


1021 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

llllllllllllllllllll llllllllllllllllllll Mill II II II II 
AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTC 


240 


Db 


1022 


1081 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

IIIIMII IIIIIIIIMIIM MMMM MM IIIIIIMIIIIIIIII III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 


300 


Db 


1082 


1141 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

II IIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 1 MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


360 


Db 


1142 


1201 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill 11 MUM 1 1 IIMII MMMM 1 llllllll 11 Mill! 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATC^ 


420 


Db 


1202 


1261 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

II IMIIIII II Mill II MMMM IIIIIIIIIIIIM IIIIIIIMIII 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


480 


Db 


1262 


1321 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

MIIMIIIIIMIMIIIIIIIIIIIII MIIIIIIIMIIIIII MMMMM 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGG\ACAATTCCTGGCGA 


540 


Db 


1322 


1381 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

Mlllllllllllllllllllll Mill Mill II MUM 1 II IIIIIIIM 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


600 


Db 


1382 


1441 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

IIIIIIIIIIIIIIIIMIII Ml III III II II Mill IIIIMII II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


660 


Db 


1442 


1501 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

IIMII! MM IMIIIII IIMIIIIIII II Mill M MMMMM 1 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


720 


Db 


1502 


1561 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 IIIIMII IIMIIMIIIIIIIIIIII llllllllllllll llllllll III 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


780 


Db 


1562 


1621 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 

1 M II 1 1 1 II II 1 II II 1 II II 1 1 M 1 II II 1 1 1 II II 1 II 1 II II II II II 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 


840 


Db 


1622 


1675 


Qy 
Db 


841 
1676 


AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 

llllllll llllllll IIIIIIIIIIIIIIIIIIIIMII 
GCCCTGGAOVCCAACTATTGCTTCAGCTCCACGGAGAAGAAC 


900 
1717 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

IIIIIIMIIIIIIIII IIMIIIIIIIIII llllllll llllllllllllll II 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


960 


Db 


1718 


1777 



Qy 


961 


GAACrr7U^GGGrTACCATGCCAA.TTTCTGCCTGGGGCCCTGTCCCTAC^TCTC 


1020 




M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 1 1 1 1 1 t 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1778 


GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 


1837 


Ov 


1021 


GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 


1080 




Mill IIIIIIMMIIIIIIIIII IIIIIIIIIIIIII IIIIIIIIIII IIIIII 




Db 


1838 


GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 


1897 


Qy 


1081 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 


1140 




IIIIIIIIIIIIIIIIIIIIIIIIIIMIIII IIIIIIIIIIIIIIIIIMIIIIIIII 




Db 


1898 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 


1957 


Qy 


1141 


AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 






Illllllllllllll IIIMIIIIIIIIIIIIIIIII IIIIIMIIIIIIIIMI 




Db 


1958 


AAGCCCAAGGTGGAGGAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2014 



AAQ56923; 

25-MAR-2003 
09-JUL-1994 



(updated) 
(first entry) 



Human pre-TGF-beta- 1 . 

TGF-beta-1; TGF-beta-2; transforming growth factor beta-1; 
transforming growth factor beta-3; recombinant; wound healings- 
vulnerary; ss. 



RESULT 11 
AAQ56923 

ID AAQ56923 standard; cDNA; 2537 BP. 
XX 
AC 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
XX 
PA 



Homo sapiens . 
Key 

misc structure 



CDS 

mat_peptide 
polyA_signal 

US5284763-A. 

08-FEB-1994 . 

04-MAR-1992; 

22-MAR-1985 
13-MAR-1987 
04-AUG-1989 
04-MAR-1992 



Location/Qualifiers 
47. . 113 
/*tag= a 

/note= "possible hairpin loop region" 
842 . .2014 
/*tag= b 
1676. .2011 
/*tag= c 
2515 . .2521 
/*tag= d 



92US-0845893 . 

85US-0715142 . 
87US-0025423. 
89US-0389929. 
92US-0845893 . 



(GETH ) GENENTECH INC. 



XX 

PI Derynk RMA, Goeddel DV; 
XX 

DR WPI ; 1994-056343/07. 

DR P-PSDB; AAR46227. 
XX 

PT Nucleic acid sequences encoding transforming growth factor-beta - 

PT diagnostic probes, and for use in therapeutics 

XX 

PS Disclosure; Fig lb; 25pp; English. 
XX 

CC cDNA sequences were determined for human pre -TGF -beta - 1 (AAQ56923) , 

CC pig TGF-beta-3 (AAQ56925) and human TGF-beta-3 (AAQ56926) , and the 

CC corresponding amino acid sequences were determined (AAR46227-29 , 

CC respectively) . A genomic fragment corresponding to a human TGF- 

CC beta-1 exon (AAQ56924) was also isolated and its amino acid sequence 

CC determined (AAR46230) . The sequences have been used in the 

CC construction of vectors for the expression of recombinant TGF- 

CC beta . 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 2537 BP; 473 A; 890 C; 742 G; 432 T; 0 Other; 

Query Match 79,8%; Score 955; DB 15; Length 2537; 

Best Local Similarity 88.4%; Pred. No. 7.7e-193; 

Matches 1058; Conservative 0; Mismatches 115; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM II IIIIMMIM IIIIIIIIIMII IIMIIIIIIIIII II III 

Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

IIMIIIIIIMI IIIIIIIM Mill llllllllllllll lllllllllllllll 

Db 902 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

IIIIIIIIIMIIIIMIIIIMIII lllllllllll MIIIIIIMI Mill III 

Db 962 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IMIMIIIMIIIIMIII MIIIMIIIIIIIMIIII Mill II II II II 

Db 1022 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IMIIIII MIIIIMIIIIM IIIMMI MM MM llllllllllll III 

Db 1082 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACGGGAGCCCGAGCCTGAG 1141 

Qy 3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

M IMMMIMIIMIIIIIIIMIMMMIIIIMMIIIII I MM Mill 

Db 1142 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 12 01 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

MM! II MIMI I I MMII MIIMM I MMIIII M MMM 

Db 12 02 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 12 61 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

M IIMIMI M IMM II MMIIII IIIIIIIIIIMII llllllllllll 



Db 



1262 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 



1321 



Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 540 

IIIIIIMIIIIIIIIIIIIIIIIIIIII Illllllllllllllll llllllllll 

Db 1322 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTC 1381 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIIIIIIIIIIIIIIIIIIIM lllll lllll II llllll I II IIIIIIIM 

Db 1382 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIIIIIIIIIIIMMIIIII III III III II II lllll llllllll II 

Db 1442 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1501 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACTVCTCCACGTGGAAATTAACGGGTTC^ 720 

lllllll Nil lillllM llllllllll] II lllll II llllllllll I 

Db 1502 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1561 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I llllllll llllllllllllllllllll IIIIMIIIIIIII llllllll III 

Db 1562 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 840 

lllllllllllllllllllllllllllll Mill llllllllllllllllll 

Db 1622 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 1675 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

llllllll llllllll lllllllllllMIIIIIIIIIII 

Db 1676 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1717 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIIIIMIIIIIIIII IIIMIIMMIII llllllll llllllllllllll II 

Db 1718 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II llllllllllllllllllll llllllll llllllll llllllll llllllll 

Db 1778 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

lllll IIIIIIIIIIIMIIIIIII llllllllllllll IIMIIIIIII llllll 

Db 1838 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 18 97 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIII IIIIIMIIIIIIIIIIIIIIIIIIII 

Db 18 98 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIII llllllllllllllllll 
Db 1958 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2014 



RESULT 12 
AAN81084 

ID AAN81084 standard; cDNA; 1560 BP. 
XX 

AC AAN81084; 
XX 



DT 25-MAR-2003 (updated) 

DT 09-OCT-1990 (first entry) 

XX 

DE Coding sequence of simian transforming growth factor-beta 1, 
XX 

KW Transforming growth factor-beta 1; tumour treatment; ss cDNA. 
XX 

OS Cercopithecus aethiops . 
XX 

FH Key Location/Qualifiers 

FT CDS 261. .1433 

FT /*tag= a 

FT sig_peptide 282.. 323 

FT /*tag= b 

FT mat_peptide 1095.. 1433 

FT /*tag= c 

XX 

PN EP293785-A. 

XX 

PD 07-DEC-1988. 
XX 

PF 27-MAY-1988; 88EP-0108528 . 
XX 

PR 29-MAY-1987; 87US- 0055662 . 

PR 25-JAN-1988; 88US-0147842 . 
XX 

PA (ONCO ) ONCOGEN. 

PA (BRIM ) BRISTOL-MYERS CO. 

XX 

PI Purchio AG, Gentry L, Twardzik D; 
XX 

DR WPI; 1988-347488/49. 

DR P-PSDB; AAP80647. 
XX 

PT Prodn. of simian transforming growth factor beta-1 - by culturing 

PT transfected eucaryotic cells, and new precursor proteins, useful for 

PT treating tumours . 
XX 

PS Disclosure; Page ?; pp; English. 
XX 

CC The cDNA is prepd. from African green monkey cell line BSC-40 and is 

CC expressed in eukaryotic cells in plasmid pSV2 . There is 100% homology 

CC between mature simian and human TGF-beta 1. The plasmid also contains 

CC the SV40 promoter and a selection marker, esp. DHFR. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 

XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 Other; 

Query Match 79.6%; Score 953.4; DB 9; Length 1560; 
Best Local Similarity 88.3%; Pred. No. 1.6e-192; 

Matches 1057; Conservative 0; Mismatches 116; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM M MMMIMII MMMMMIM MMMMMMM II III 

Db 261 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 



Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

lllllllll lllllllllllll Mill IIIIIIMMIMI Mllllll llllll 

Db 321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 38 0 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

lllllllllllllllllllll MM IIIMMMM Mlllllllll Mill III 

Db 381 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 44 0 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIMIIMIMIIMIIM MMMMIIIIMMMII IIIM II II M II 

Db 441 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 50 0 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IIIMMI IIIIIIIIIMIM IIIMMI MM M MIIIIIIIM II III 

Db 501 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 560 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II IIMIIIIIIMIIIIIIIIIIIIIIMIMIIIMMMIII I MM Mill 

Db 561 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 62 0 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill II IIMII I I I llllll Mllllll I IIIIIIM II llllll 
Db 621 TATGACAAGTTCAAGCAGAGCACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 68 0 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 80 

II Mill II M IIMIIII IIMIIII IIIIMIIIIIMI llllllllllll 

Db 681 CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 0 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

lllllllllll IIIIIMI IIMIIII lllllllllllllllll IMMIIIII 
Db 741 AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 800 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCTVGTGACTCACCGGAGTGGCTGTCCTTTGATGT^ 600 

IIIIIIIIIIMIMMIIIIII Mill MM MMIIIII MM IIIIIIMI 

Db 8 01 TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 860 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

lllllllllllllllllllll III MM 111 II II Mill Mllllll II 

Db 861 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 92 0 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

IIIIMI MM IIIIIIIIMIIIIIIIIIII II IIIM II IMMIIIII I 

Db 921 GCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGTTCACT 98 0 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I Mllllll IIIIIMIIIIMI MM! IIMIIIIIIIIII Mllllll Ml 

Db 981 ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 104 0 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

MMMMMMMMMIMMIM II IIIM llllllllllllllllll 

Db 1041 ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA 10 94 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

Mllllll IIMIIIIIIIIIIIIIIIIIIIIIIIMIIII 
Db 1095 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 1136 

Qy 901 TGCTGCX5TGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 



Mlllllllllllllll M lllllllllll IIIIIIM MIIIIMIIIIII II 

Db 1137 TGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1196 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

II llllllllilllllllllll llllllllllllllllllllllllll llllllll 
Db 1197 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTG 1256 

Qy 1021 GAC7VCTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill IIIIIIIIMIIIIIIIIM llllllllllllll lllllllllll mill 

Db 1257 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1316 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIMlllllllllllllllllllllllllllll 

Db 1317 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1376 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIIMIIIIIIMIIIIIIIIMIIIIIIIII lllll II lllllllll 

Db 1377 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGAAAATGCAGCTGA 1433 



RESULT 13 
AAQ03508 

ID AAQ03508 standard; DNA; 1560 BP. 
XX 

AC AAQ03508; 
XX 

DT 25-MAR-2003 (updated) 
DT 09-JAN-2003 (updated) 
DT 14-AUG-1990 (first entry) 
XX 

DE Simian Transforming growth factor - Betal, 
XX 

KW HIV; AIDS; SIV; vaccine; AZT; CD4 ; cytokines; growth 

KW factors; ds. 

XX 

OS Cebus apella. 
XX 

FH Key Location /Qualifiers 

FT CDS 267. .1437 

FT /*tag= a 

FT mat_peptide 1103 . . 1437 

FT /*tag= b 

XX 

PN EP356935-A. 
XX 

PD 07-MAR-1990. 
XX 

PF 25-AUG-1989; 89EP-0115719 . 
XX 

PR 25-AUG-1988; 88US-0236698 . 
XX 

PA (ONCO ) ONCOGEN LP. 
XX 

PI Brankovan V, Lioubin M, Purchio A; 

XX 

DR WPI; 1990-068723/10. 
DR P-PSDB; AAR05663, 



XX 

PT Compsns . contg. transforming growth factor beta - 

PT used for inhibitions of HIV infection and replication in vivo. 

XX 

PS Disclosure; Fig 1; 20pp; English. 
XX 

CC TGF-beta may be used in vivo to prevent formation of synctia and 

CC inhibit HIV infection. TGF may also be used with other HIV treatments 

CC (AZT, soluble CD4 etc.). 

CC (Updated on 09-JAN-2003 to add missing OS field.) 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 other; 

Query Match 79.6%; Score 953.4; DB 11; Length 1560; 

Best Local Similarity 88.3%; Pred. No. 1.6e-192; 

Matches 1057; Conservative 0; Mismatches 116; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM II IIIIIIMMI lllllllllllll IIMIMIIIIIII M III 

Db 261 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 320 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

IIIIIMII IIMIIIIIIIII lllll IIIIIIIIMMM IMIMII Mill! 

Db 321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 380 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

IIIIIIIIIIIIIIIIIIIM MM lllllllllll IIMIIIIIM Mill III 

Db 381 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 44 0 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 240 

llllllllllllllllllll IIIIIIIIIIIIIIIIIMI lllll II II II II 

Db 441 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 500 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IMIMII llllllllllllll IIIMIII III! II IMIIIIIMI II III 

Db 501 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 560 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II MIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMII I MM lllll 

Db 561 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 620 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

lllll II llllll I I I llllll llllllll I IIMIIII II llllll 
Db 621 TATGACAAGTTCAAGCAGAGCACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 680 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

II Mill II II IMIMII MIIIMI IIIMIMIIIIM llllllllllll 

Db 681 CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 740 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

lllllllllll MIIIMI llllllll lllllllllllllllll IIIIIIMM 

Db 741 AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 800 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIIIIIIIIIIIIIIIIIIIM lllll MM lllllllll MM IIIIIIIM 

Db 801 TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 860 



Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIMIIIIIIIIIIIIIIIII III Mil 111 II II Mill llllllll II 
Db 861 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 92 0 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACG^^ 720 

lllllll III! IIIIIMIIIIIIIIIIIIII M Mill M IIIIIIIMI I 

Db 921 GCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGTTCACT 98 0 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

I IIIIMM IIIIIMIIIIIII Mill llllllllllllll llllllll III 

Db 981 ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1040 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIIIIMIIIIIIIMIIIIIIIM II Mill llllllllllllllllll 

Db 1041 ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA 1094 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

llllllll IIMIIIIIIIIIIIIIIIIIIIMIIIIIIII 
Db 1095 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 1136 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIIIIMIIIIIIIII II lllllllllll llllllll llllllllllllll II 

Db 1137 TGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1196 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

II llllllllllllllllllll IIIIIIIIIMIIIIIIIIIIMIII llllllll 

Db 1197 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTG 1256 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

IMM IIIMMIIIIIIIIIIIII IMIIIIIIIIIII IIIIIIIIIM llllll 

Db 1257 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1316 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1140 

III llllll II III llllll MM IMIIIIIIIIIII Mill MM Mill I Mill II 

Db 1317 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1376 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

llllllllllllllllllllllllllllllllllllll Mill II lllllllll 

Db 1377 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGAAAATGCAGCTGA 1433 



RESULT 14 




AAV52933 




ID 


AAV52933 Standard; cDNA; 2537 BP. 




XX 






AC 


AAV52933; 




XX 






DT 


25-MAR-2003 (updated) 




DT 


21-DEC-1998 (first entry) 




XX 






DE 


Human pre -trans forming growth factor-beta 1 


cDNA. 


XX 






KW 


Transforming growth factor-beta 1; TGF-beta 


1; human; ss 


XX 






OS 


Homo sapiens . 




XX 







FH Key Location/Qual if iers 

FT CDS 842 . .2014 

FT /*tag= a 

FT mat_peptide 1676.. 2011 

FT /*tag= b 

FT stem_loop 37.. 113 

FT /*tag= b 

FT /note= "putative stable hairpin loop" 

FT misc_feature 2015.. 2100 

FT /*tag= c 

FT /note= "GC-rich sequence" 

FT polyA_signal 2514.. 2520 
FT /*tag= d 

XX 

PN US5801231-A. 
XX 

PD Ol-SEP-1998. 
XX 

PF 30-MAY-1995; 95US - 04544 68 . 
XX 

PR 13-MAR-1987; 87US-0025423 . 

PR 22-MAR-1985; 85US-0715142 . 

PR 04-AUG-1989; 89US- 038 9929 . 

PR 04 -MAR- 1992; 92US- 08458 93 . 

PR 05-NOV-1993; 93US-0147364 . 

PR 30-MAY-1995; 95US-0454468 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1998-494840/42. 

DR P-PSDB; AAW78785. 
XX 

PT DNA encoding transforming growth factor-beta precursor sequence - 

PT useful for analysis to perform manipulations to increase yield of 

PT recombinant production of the protein 
XX 

PS Example 3; Fig IB 1-3; 26pp; English. 
XX 

CC This nucleotide sequence codes for the human transforming growth 

CC factor-beta 1 precursor (preTGF-beta 1, see AAW78785) . It is a 

CC composite of overlapping cDNA clones isolated from different cDNA 

CC libraries (placenta, A172 glioblastoma, HT1080 f ibroblastoma) using 

CC TGF-beta exon (see AAV52936) restriction fragments as probes. 

CC The 3' region of the sequence was determined using cloned genomic 

CC DNA. The invention relates to the recombinant production of 

CC TGF-beta. Biologically active TGF-beta is defined as being capable 

CC of inducing EGF-potentiated anchorage independent growth of target 

CC cell lines and/or growth inhibition of neoplastic cell lines. 

CC Nucleic acids encoding TGF-beta have been isolated and cloned into 

CC vectors which are replicated in bacteria and expressed in 

CC eukaryotic cells. TGF-beta recovered from transformed cells is 

CC used in known therapeutic applications. TGF-beta nucleic acids are 

CC also useful in diagnosis and identification of TGF-beta clones. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 



SQ Sequence 2537 BP; 475 A; 895 C; 736 G; 431 T; 0 other; 



Query Match 79.5%; Score 951.8; DB 19; Length 2537; 

Best Local Similarity 88.2%; Pred . No. 3.6e-192; 

Matches 1056; Conservative 0; Mismatches 117; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III III! II lllllllllll lllllllllllll llllllllllllll II III 
Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

IIMIIIIIIIII lllllll I Mill llllllllllllll lllllllllllll I 

Db 902 CTGACGCCTGGCCCGCCGGCCCCGGGACTATCCACCTGCAAGACTATCGACATGGAGCAG 961 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

llllllllllllllllllllllllll lllllllllll lllllllllll Mill III 

Db 962 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIIMIIIIIMIIIIIII MIIIIIIIIIIIIMIIII Mill II II II II 

1022 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 



241 



300 



TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

llllllll IMIIIIIIIIMI IIMIIII MM IIIIIIIIIMIIIIII Ml 

1082 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 

3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 60 

II IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMMIIIIIII I MM Mill 

1142 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 1201 



361 



420 



TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill II IIIIM I I llllll llllllll I llllllll II llllll 

12 02 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 1261 



421 



480 



CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

M IIMIIII II Mill II IMMIII llllllllllllll llllllllllll 

1262 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 1321 



481 



540 



AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

IIIIIIIIIIIIIIMIIIIIIIIIIIII lllllllllllllllll MIIMIIM 

1322 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 1381 



541 



600 



TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

lllllllllllllllllllllll Mill Mill II llllll I II lllllllll 

1382 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 



601 



660 



ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

lllllllllllllllllllll III III Ml I II Mill IIIIMII II 

1442 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGAGAAATTGAGGGCTTTCGCCTTAGC 1501 



661 



720 



GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

lllllll MM llllllll lllllllllll II Mill II IIIIIMIII I 

1502 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1561 



721 



780 



TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

I llllllll IIIIIIIIIIIIMIIIIII IIMIIIIIIIIII IIIIMII III 

1562 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 



Qy 


781 


Db 


1622 


Qy 


841 


Db 


1676 


Qy 


901 


Db 


1718 


Qy 


961 


Db 


1778 


Qy 


1021 


Db 


1838 


Qy 


1081 


Db 


1898 


Qy 


1141 


Db 


1958 



ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIMIIIIIIIIIIIIIIIIIMIMII Mill IIMIIIMIIIIIIIII 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 1675 

AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

llllllll llllllll llllllllllllllllllllllll 



1717 



TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

lllllllllllllllll llllllllllllll llllllll llllllllllllll II 
TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II llllllllllllllllllll llllllll llllllll IIIMIII llllllll 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 
GACACTCAGTACAGCAAGGTCCTGGCTCTGTACTUVCCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill Mllllllllllllllllll llllllllllllll IMIIIMIII mill 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGra 18 97 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1140 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIII IIIIIIIIMIIIMMIIIIIIIIII 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 
AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIII IIIIIIIIIIIIIIIIM 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2014 



RESULT 15 
ABV75391 

ID ABV75391 standard; DNA; 1176 BP. 

XX 

AC ABV75391; 
XX 

DT 18-MAR-2003 (first entry) 
XX 

DE TGFBl Arg25Pro polymorphism G-allele nucleotide sequence. 
XX 

KW Polymorphism; transforming growth factor beta 1; TGFBl; TGFbeta-1; 

KW renal failure; nephrotropic ; human; gene; ds . 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT CDS 1..1176 
FT /*tag= a 

XX 

PN WO200290585-A2. 
XX 

PD 14-NOV-2002. 
XX 

PF 08-MAY-2002; 2 0 02WO-GB02 066 . 
XX 

PR 09-MAY-2001; 2001GB- 0011277 . 
XX 



PA (UYSH-) UNIV SHEFFIELD HALLAM . 
XX 

PI El-Nahas AM, Blakemore A, Khalil MS; 
XX 

DR WPI; 2003-120560/11. 

DR P-PSDB; ABB82780, 

XX 

PT Determining an individual's susceptibility to the progression of renal 

PT failure comprises detecting the presence of a genetic polymorphism 

PT pattern in transforming growth factor beta 1 (TGFBl) gene in a sample 

PT from the individual 
XX 

PS Claim 49; Page 58; 62pp; English. 
XX 

CC The invention relates to determining an individual's susceptibility to 

CC renal failure and invovles detecting the presence of a genetic 

CC polymorphism pattern in transforming growth factor beta 1 (TGFBl) gene in 

CC a sample from an individual, where polymorphism pattern is associated 

CC with renal failure. The method is useful for determining an individual's 

CC susceptibility to the progression of renal failure. The nucleic acid 

CC comprising a T(-509)C polymorphism of TGFBl gene, or a polypeptide 

CC comprising a sequence of 391 amino acids is useful for preparing a 

CC medicament for retarding or preventing the progression of renal disease, 

CC and for drug research purposes for retarding or preventing the 

CC progression of renal disease. Sequences ABV75386-88 represents the 

CC nucleotide sequence for the TGFBl G-allele of the Arg25Pro polymorphism 

CC of exon 1 . 

XX 

SQ Sequence 1176 BP; 239 A; 381 C; 354 G; 202 T; 0 other; 

Query Match 79.0%; Score 945.2; DB 25; Length 1176; 

Best Local Similarity 88.3%; Pred. No. 8.1e-191; 

Matches 1060; Conservative 0; Mismatches 113; Indels 27; Gaps 2; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

IN MM M MMMMMI MMMMMMI MMMMMMM M Ml 

Db 1 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 60 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

MMMMMMMMMMMI MMI MMMMMMM MMMMMMMI 

Db 61 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 12 0 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

MMMMMMMMMMMMM MMMMMI IMIIIIIIII Mill Ml 

Db 121 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 18 0 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIIIMIIIIIIIIIIMI IIIIIIIIIIIIIIIIIIM Mill II II M II 

Db 181 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 24 0 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

MMIMI IMIIIIIIIIIII llllllll MM IIIIIIIIIIIIIIMI III 

Db 241 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 3 00 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II IIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIM I MM Mill 

Db 301 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 360 



Qy 361 TATGATAAATTCAAGGGCACCCCCCTJ^CAGCTTATATATGCTGTTCAACACGTCGGAG 42 0 

lllll M iiiiii I I Miiii llllllll I llllllll II mill 

Db 361 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACA 42 0 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT---GAGG 477 

II llllllll II lllll II IIIIIMI IIIIIIMIIIIII lllll MM 

Db 421 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGG 48 0 

Qy 478 CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGG 537 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIII IIIIIIIIIIIIIMII Mllllll 

Db 481 CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGG 54 0 

Qy 538 CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 597 

II IIIIIIIIIIIMMIIIIIIII lllll Mill II IIMII I II MUM 

Db 541 CGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGAT 600 

Qy 598 GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 657 

llllllllllllllllllllllll III III III II II lllll Mllllll 
Db 601 GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 660 

Qy 658 AGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 717 

II lllllll MM llllllll Mlllllllll M lllll II IIIMIIM 

Db 661 AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATGUVCGGGTTC 720 

Qy 718 AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 777 

I I I IIIMIII llllllllllllllllllll llllllllllllll llllllll 

Db 721 ACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTT 780 

Qy 778 CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGAC 837 

IIIIIIIMIIIIMIIIIIIMIIIMIIII lllll MIIIIIIIIIIIIIMI 

Db 781 CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA--- 837 

Qy 838 TACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAG 8 97 

Mllllll llllllll IIIIIIIIIIIIIMIIIIII 
Db 838 GCCCTGGACACCAACTATTGCTTCAGCTCGVCGGAGAAG 876 

Qy 8 98 AACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATT 957 

llllllllllllllllllll llllllllllllll llllllll llllllllllllll 
Db 877 AACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATC 936 

Qy 958 CATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGC 1017 

II II IIMIIIIIIIIIIIIIIII llllllll llllllll llllllll IIMII 

Db 937 CACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGC 996 

Qy 1018 CTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACT^CCCGGGCGOT 1077 

II lllll IMIIMIIIIIIIIIIIII llllllllllllll Mlllllllll III 

Db 997 CTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCG 1056 

Qy 1078 GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGC 1137 

IMIIIIIIIIIIIIIIIIIIIMIIIIIIMIII IIIIIIIIMIIIIIIIIIMIII 

Db 1057 GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGC 1116 

Qy 1138 CGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IMIIIMIIIIIIIIIIIIIIIMMIIIIIIIIIIIIII IIIIIMIIIIIIIIIII 

Db 1117 CGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1176 
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ALIGNMENTS 



RESULT 1 
US-10-087-268-1 

; Sequence 1, Application US/10087268 
; Publication No. US20030119010A1 



GENERAL INFORMATION: 
APPLICANT: Jonsonn, Julie Ruth 
APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

FILE REFERENCE: Fibrosis 

CURRENT APPLICATION NUMBER: US/10/087,268 
CURRENT FILING DATE: 2002-03-01 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE: PatentIn version 3.1 
SEQ ID NO 1 
LENGTH: 1821 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 
NAME /KEY: 5 ' UTR 
LOCATION: (1) . . (511) 
OTHER INFORMATION: 
NAME /KEY: CDS 
LOCATION: (512) . . (1684) 
OTHER INFORMATION: 
NAME/KEY : s ig_pept ide 
LOCATION: (512) . . (598) 
OTHER INFORMATION: 
NAME/ KEY: 3 ' UTR 
LOCATION: (1685) . . (1821) 
OTHER INFORMATION: 
US-10-087-268-1 

Query Match 80.1%; Score 958.2; DB 14; Length 1821; 

Best Local Similarity 88.6%; Pred. No. 1.2e-258; 

Matches 1060; Conservative 0; Mismatches 113; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM II IIMMMMI IMIIIIIIIIII IIMIIIIIIIMI M III 

Db 512 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 571 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

MIMMIMIMIIIIIIMII IMM IIMMIIMIIM MIIIIIIMMMI 

Db 572 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 631 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

MIMIIMIIIMIIMIIIMIM IIMIIIIIII IIMMMMI Mill III 

Db 632 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 691 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIMMIMMIMMMM MMIMIIMMMIMII MM! II M II II 

Db 692 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 751 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IMIIMI MIIIMIIIMM IMIMM MM IIIMMMIIIMIII III 

Db 752 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 811 

Qy 3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 60 

II IMIIIIIIMIMIMMIIIIIMIIIIIIIIIMIIMM I MM Mill 

Db 812 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 871 



Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill II Mill! I I MINI llllllll I llllllll II Mill! 

Db 872 TATGACAAGTTCAAGCAGAGTACACT^CAGCATATATATGTTCTTCAACACATCAGAGCTC 931 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 48 0 

II llllllll II Mill II llllllll llllllllllllll llllllllllll 
Db 932 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 991 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IMIIIIIIIIIIIIIIIIIIIMIIIII IIMIIIIIIIIIIIII llllllllll 

Db 992 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACT^T^ 1051 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIIIIIIIIIIIMIIIIIIII Mill Mill II llllll I II IIIIIIMI 

Db 1052 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1111 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIIIIIIIIIIMIIIIIIII III III III II II Mill llllllll II 
Db 1112 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1171 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

lllllll MM llllllll IIIIIIIMII II Mill II llllllllll I 

Db 1172 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1231 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

I IIMIIII llllllllllllllllllll IIIIMIIIIIIII llllllll III 

Db 12 32 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 12 91 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIIMIIIIIIIIIIIIIIIIIIillll Mill IIIIIIIIIIIIIMMI 

Db 1292 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 1345 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

llllllll Mllllll IIIIIIIIIIIIIMIIIIIIIII 

Db 134 6 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1387 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IMIIIIMIIIIIIII llllllllllllll IIMIIII IIIIMIIIIIIII II 

Db 1388 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCXSGCTGGAAGTGGATCCAC 1447 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

II IIIIMIMIIIIIIIIIII Mllllll IIIIMII llllllll llllllll 

Db 144 8 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1507 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

Mill IIIIIIIIIIIIIIIIIMI IIIIIMIIIIIII lllllllllll llllll 

Db 1508 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1567 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMII lllllllllllllllllllllllllll 

Db 1568 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1627 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIMIIIIIIIIIIIIIIIMIMIIIMIIIIIII llllllllllllllllll 

Db 1628 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1684 



RESULT 2 

US-10-037-270-220 

Sequence 220, Application US/10037270 
Publication No. US20030104529A1 
GENERAL INFORMATION: 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Asundi, Vinod 
APPLICANT: Zhang, Jie 
APPLICANT: Ren, Feiyan 
APPLICANT: Chen, Rui-hong 
APPLICANT: Zhao, Qing A. 
APPLICANT: Wehrman, Tom 
APPLICANT: Xue, Aidong J. 
APPLICANT: Yang, Yonghong 
APPLICANT: Wang, Jian-Rui 
APPLICANT: Zhou, Ping 
APPLICANT: Ma, Yunqing 
APPLICANT: Wang, Dunrui 
APPLICANT: Wang, Zhiwei 
APPLICANT: Tillinghast, John 
APPLICANT: Drmanac, Radoje T. 

TITLE OF INVENTION: No. US2 003 0104529Alel Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 784CIP2B 

CURRENT APPLICATION NUMBER: US/ 10/037 , 270 
CURRENT FILING DATE: 2002-01-04 
PRIOR APPLICATION NUMBER: 09/552,317 
PRIOR FILING DATE: 2000-04-25 
PRIOR APPLICATION NUMBER: 09/488,725 
PRIOR FILING DATE: 2000-01-21 
NUMBER OF SEQ ID NOS : 1104 
SOFTWARE: pt_FL_genes Version 1.0 
SEQ ID NO 220 
LENGTH: 2742 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (842) . . (2014) 
US-10-037-270-220 

Query Match 80.1%; Score 958.2; DB 14; Length 2742; 

Best Local Similarity 88.6%; Pred. No. 1.3e-258; 

Matches 1060; Conservative 0; Mismatches 113; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM M IIIIIIIIIM IIMIIIIMIII IIMIMMIIMI II Ml 

Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

IMMIIMIMIIIIIMIIM lllll llllllllllllll IIIIIIIIIIIMII 

Db 902 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

MMMMMMMMMMMMM IIMIIMIII MIMMIIII lllll III 

Db 962 GTGAAGCGGAAGCGC::aTCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 



Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

llllllllllllllllllll MIMIMIIIIIIIIIIII Mill II II II II 

Db 1022 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

IIIIIMI MMIMIIIMII IMIMM MM MMMMMIMMM Ml 

Db 1082 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II IIIIIMMIIIIMIIIIMIIIMIIIIIIIIIIIIIIIII I MM Mill 

Db 1142 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 1201 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

Mill II lIMM I I IIIIII IIIIIIII I IIIIIIII M MUM 

Db 12 02 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCT 1261 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

II MMIIII M Mill II MMMM MIIIMIIIMII IIIIIMIMM 

Db 1262 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 1321 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

MMMMMMMMMMMIMMM IIMMIIIIIIIIIII IIIIIIMII 

Db 1322 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 1381 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIIIIIIMIIIIIIIIIIIII Mill Mill II IIIIII I II lllllllll 

Db 1382 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 

Qy 6 01 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

lllllllllllllllllllll III III Ml II II Mill IIIIIIII II 

Db 14 42 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1501 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

lllllll MM IIIIIIII lllllllllll II Mill II llllllllll I 

Db 1502 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATC^ 1561 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

I IIIIIIII IMIIIIMIIIIIIIIIII llllllllllllll IIIIIIII III 

Db 1562 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IMIIIMIIIIIIIIIIIIIIIIMIII Mill IIIIIMIIIIIIIIIII 

Db 1622 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 1675 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

MMIIII IIIIIIII IIIIIMIIMIIIIIIIIIIIII 

Db 1676 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1717 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

MMMMMIMMM IIIIIIMIIIIII IIIIIIII llllllllllllll II 

Db 1718 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

II MMMMIIMMMMM MMMM IIIIIIII IMIMM IIIIMII 

Db 177 8 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 



Qy 


1021 


Db 


1838 


Qy 


1081 


Db 


1898 


Qy 


1141 


Db 


1958 



IIIIMMIIIIIMIIIMIIIIMIIIIII MIIIIIMIIIIIIIIIIIIIII 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCI 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 
IIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIIIIIMIIII 



RESULT 3 

US-10-087-268-4 

; Sequence 4, Application US/10087268 
; Publication No. US20030119010A1 
; GENERAL INFORMATION: 

APPLICANT: Jonsonn, Julie Ruth 
; APPLICANT: Powell, Elizabeth Ellen 

; TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

FILE REFERENCE: Fibrosis 
; CURRENT APPLICATION NUMBER: US/l 0/087 , 268 
; CURRENT FILING DATE: 2002-03-01 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 4 

LENGTH: 1821 

TYPE: DNA 

ORGANISM: Human 

FEATURE : 

NAME/KEY: 5'UTR 

LOCATION: (1) . . (511) 

OTHER INFORMATION: 

NAME /KEY: CDS 

LOCATION: (512) .. (1684) 

OTHER INFORMATION: 

NAME/KEY: sig_peptide 

LOCATION: (512).. (598) 

OTHER INFORMATION: 

NAME /KEY: 3 ' UTR 

LOCATION: (1685) (1821) 

OTHER INFORMATION: 
US-10-087-268-4 

Query Match 79.9%; Score 956.6; DB 14; Length 1821; 

Best Local Similarity 88.5%; Pred. No. 3.3e-258; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM II IIMIIIIMI IMIIIIIIIIII IIIIMIIMIIII II III 

Db 512 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 571 



Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

IIMIMIIIIII IIIIIIIM IIMI IIIIIMIMIIII IIIIIIIMIIIIII 

Db 572 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 631 



Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

IMIIIIIIIMIIIIIIIMIIIM IMIMIIIII IIIMIIIIII Mill III 

Db 632 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 691 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIMIIIIMMMIMM MMIMIMIIIIIMIII Mill M M M II 

Db 692 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 751 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

IIMIIM IIIMMIIIIIII IIIMIM MM lllllllllllllllll III 

Db 752 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 811 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II MIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIMII I MM Mill 

Db 812 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 871 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

IMM II MUM I I MMII MIIIMI I MIIMM M MUM 

Db 872 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACAT 931 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 48 0 

II IIIMIM II Mill II MIIIMI llllllllllllll llllllllllll 

Db 932 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 991 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIIIIIIIIIIIIIIIIIIIIIIMIMI IIIIMIIIIIIIIIII MIIIMIII 

Db 992 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 1051 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIIIIIIIIIIIMIIIIMII Mill Mill II MMII I M IMIIIIII 

Db 1052 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1111 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IIIIIIIIIIIIIIIMIIM III III III II II Mill IIIMIM M 

Db 1112 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1171 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

lllllll MM IIIMIM lllllllllll II Mill II IIMIIIIII I 

Db 1172 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1231 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

I llllllll IIIIIIIMIIIIIIIIIII llllllllllllll IIIMIM III 
Db 1232 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1291 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIIIIIIIIMIMIIMIIIIIIIIII IMM IIIIMIIIIIIIIIIII 

Db 12 92 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 134 5 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

llllllll llllllll IIIIIIIIIIIIIIIIMIIIIII 
Db 1346 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 1387 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIIIIIIIIIIIIMI IIMIIIIMIIII llllllll IIIIIIIIIIIIM II 

Db 1388 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 14 47 



Qy 


961 


Db 


1448 


Qy 


1021 


ijjj 


lb Uo 


Qy 


1081 


Db 


1568 


Qy 


1141 


Db 


1628 



II IIIIIIIIMIIIIIIMII llllllll Mllllll llllllll lllllll 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCe 
GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAAGCAGCACAACCCGGGCGCGTCGG( 

Mill IIIIIIIIIIIMMIIIII IIMIIIIMMII IIMIMMII MM 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGG( 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCC 

MMMMMMMMMUMMMMMM M M M M M M M M M M M M I 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCC 
AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA : 

MMMMMMMMMIMMMMMMMMMI MMMMMMMMM 



RESULT 4 

US-09-948-002-28 

; Sequence 28, Application US/09948002 

; Publication No. US20030050265A1 

; GENERAL INFORMATION: 

; APPLICANT: Nicholas M. Dean 

; APPLICANT: Susan F. Murray 

; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
; TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE: ISPH-0607 

; CURRENT APPLICATION NUMBER: US/ 09/ 94 8 , 002 

; CURRENT FILING DATE: 2000-09-05 

; PRIOR APPLICATION NUMBER: 09/661,753 

; PRIOR FILING DATE: 2000-09-14 

; PRIOR APPLICATION NUMBER: 60/154,546 

; PRIOR FILING DATE: 1999-09-17 

; NUMBER OF SEQ ID NOS : 71 

; SEQ ID NO 28 

LENGTH: 2745 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (842) ... (2017) 
US-09-948-002-28 

Query Match 78.8%; Score 943.6; DB 11; Length 2745; 

Best Local Similarity 88.2%; Pred. No. 1.7e-254; 

Matches 1059; Conservative 0; Mismatches 114; Indels 27; Gaps 2; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

Mi MilMIMMMMM M M M M M M I M M M I M M M I M M I 

Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

MMMMMMI MMMMI MMI MMMMMMM MMMMMMMI 

Db 902 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 



Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

MMMMMMMMMMMMM MMMMMI MMMMMI MMI Ml 



Db 962 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

MMIIMIIIMMIMII IIIMIIIIIIIMIIMM IIIM M II M II 

Db 1022 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

llllllll MIIIIIIIIIIM llllllll MM IMMIMMMMMI Ml 

Db 1082 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 60 

M MMMMMMMMMMMMMMMMMMMMMI I MM MMI 

Db 1142 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 1201 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

MMI II MUM I I IIIIM MIIIIM I llllllll II llllll 

Db 1202 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 1261 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT GAGG 4 77 

M IMMMI M MMI M IMMMI MIMMMIIMI Mill MM 

Db 1262 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGG 1321 

Qy 478 CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGG 537 

MIMMMIIMIIIIIIIIMIMIIIMI IIMIIMMIIIMM MIIIIM 

Db 1322 CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGG 1381 

Qy 538 CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 5 97 

M MMMMIMMMMMMM MMI MMI II llllll I II IIIIM 

Db 1382 CGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGAT 1441 

Qy 598 GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 657 

llllllllllllllllllllllli Ml III III M II Mill llllllll 

Db 1442 GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 1501 

Qy 658 AGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 717 

II IIIIIM MM IMMMI lllllllllll II Mill II lllllllll 

Db 1502 AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC 1561 

Qy 718 AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 777 

I I I llllllll IIIIIMIIIIIIMIIIII llllllllllllll IMMMI 

Db 1562 ACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTT 1621 

Qy 778 CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGAC 837 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIII Mill IIIIIIIIIIIIIIIMI 
Db 1622 CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA--- 1678 

Qy 838 TACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAG 897 

MIIIMI llllllll IIIIIIIIIIIIIMIMIII 

Db 1679 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAG 1717 

Qy 898 AACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATT 957 

Mllllllllllllllllll llllllllllllll llllllll IIIIIIIIIIMII 
Db 1718 AACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATC 17 77 

Qy 958 CATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGC 1017 

II M IIIIIIIIIIIMIIMIII MIIIIM MIIIIM MIIIIM llllll 

Db 1778 CACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGC 1837 



Qy 1018 CTAGACACTO^GTACAGCAAGGTCCTGGCTCTGTACAACCAGC^ 1077 

II lllll llllllllllllllllllll llllllllllllll lllllllllll III 
Db 1838 CTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAAC^^ 1897 



Qy 1078 GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGC 1137 

IIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIII llllllllllllllllllllllll 

Db 18 98 GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGC 1957 

Qy 1138 CGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIMIINIMIIIIIIIIIIIMIIIIIIIIIIII IIIIIIMIIIIIIMM 

Db 1958 CGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2017 



RESULT 5 

US-09-948-002-27 

; Sequence 27, Application US/09948 002 

; Publication No. US20030050265A1 
; GENERAL INFORMATION: 

; APPLICANT: Nicholas M. Dean 
APPLICANT: Susan F. Murray 
; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
; TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE: ISPH-0607 

; CURRENT APPLICATION NUMBER: US/09/948,002 

; CURRENT FILING DATE: 2000-09-05 

PRIOR APPLICATION NUMBER: 09/661,753 
; PRIOR FILING DATE: 2000-09-14 
; PRIOR APPLICATION NUMBER: 60/154,546 
; PRIOR FILING DATE: 1999-09-17 
; NUMBER OF SEQ ID NOS : 71 
; SEQ ID NO 27 

LENGTH: 158 5 

TYPE: DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (413 )...( 1585) 
US-09-948-002-27 

Query Match 71.4%; Score 854.2; DB 11; Length 1585; 

Best Local Similarity 83.1%; Pred. No. 1.7e-229; 

Matches 995; Conservative 0; Mismatches 178; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM llllllllllllll lllllll lllll II II I Illlll Mllll 

Db 413 ATGCCGCCCTCGGGGCTGCGGCTGCTGCCGCTTCTGCTCCCACTCCCGTGGCTTCTAGTG 4 72 

Uy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

MMMM M MM MMI MMI II M M M M M I M M M II M M M I M 

Db 473 CTGACGCCCGGGAGGCCAGCCGCGGGACTCTCCACCTGCAAGACCATCGACATGGAGCTG 532 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

lllll llllllllllllll Mill M IIIMIII MMMM II MM III 

Db 533 GTGAAACGGAAGCGCATCGAAGCCATCCGTGGCCAGATCCTGTCCAAACTAAGGCTCGCC 592 



Qy 



181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 



II lllllllllllllllll II IIIII IIIIIIIIIII IIIII II II III I 

Db 593 AGTCCCCCGAGCCAGGGGGAGGTACCGCCGGGCCCGCTGCCCGAGGCGGTGCTCGCTTTG 652 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

Illlllll IIIIIIIIIIIIM II II II II I II llllllllllllll III 

Db 653 TACAACAGCACCCGCGACCGGGTGGCAGGCGAGAGCGCTGACCCGGAGCCCGAGCCCGAG 712 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

Mlllllllllllllll II lllllllllllllllllllllll II MM MM 

Db 713 GCGGACTACTACGCCAAAGAAGTCACCCGCGTGCTAATGGTGGACCGCAACAACGCAATC 772 

Qy 361 TATGATAAATTC75^GGGC7^CCCCCCACAGCTTATATATGCTGTTCAAC^ 42 0 

IIIII III III IIIII IIIII Illlllll I IIIII IIIII II I 
Db 773 TATGACAAAACCAAAGACATCACACACAGTATATATATGTTCTTCAATACGTCA^ 832 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

Illlllll IIIII Mill llllll II II IIIIIIIIMIIIIII III II 
Db 833 CGGGAAGCAGTGCCAGAACCCCCATTGCTGTCCCGTGCAGAGCTGCGCCTGCAGAGATTC 8 92 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

MM M MMMM Mill M M IMIIMMM IMM IIMMMM 

Db 8 93 AAGTCAACTGTGGAGCAACACGTAGAACTCTACCAGAAATATAGCAACAATTCCTGGCGT 952 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IIIII I llllllllllll llllll III I M IIIIIIIIIII IIIII III 
Db 953 TACCTTGGTAACCGGCTGCTGACCCCCACTGATACGCCTGAGTGGCTGTCTTTTGACGTC 1012 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

II Illlllll Mlllllllllll II MM I III MM llllll Mill 

Db 1013 ACTGGAGTTGTCCGGCAGTGGCTGAACCAAGGAGACGGAATACAGGGCTTTCGCTTCAGT 1072 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

II MM II I MIIMIMIIIIM MMIIIMIIIIII M Ml III I 

Db 1073 GCTCACTGCTCTTGTGACAGCAAAGATAATGTACTCCACGTGGAAATCAATGGGATCAGT 1132 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I M II IIIIIIIIII llllll M I lllllllll lllllllllllllll 

Db 1133 CCCAAACGTCGAGGTGACCTGGGCACCATCCATGACATGAACCGACCCTTCCTGCTCCTC 1192 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIIIIIIIIII Mill IIIII II lllllllllllllll lllllll II 

Db 1193 ATGGCCACCCCCCTGGAAAGGGCTCAACACCTGCACAGCTCCAGGCACCGGAGA 1246 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

IIIIMIMIIMIIIIIIIIIIIIIIIIIM lllllllll 

Db 1247 GCCCTGGATACCAACTACTGCTTCAGCTCCACAGAGAAGAAC 1288 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

IIIII II Illlllll MMMM III II I II II llllll M lllllllll II 

Db 1289 TGCTGTGTACGGCAGCTGTACATTGACTTTAGGAAGGACCTGGGTTGGAAGTGGATCCAC 1348 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

M IIIIIIIIMIIIMIllll IIIII Mlllllllll MMMM Illlllll 

Db 134 9 GAGCCCAAGGGCTACCATGCCAACTTCTGTCTGGGGCCCTGCCCCTACATTTGGAGCCTG 14 08 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

IIIII lllllllllllllllll II II Illlllll IIIIIIMIII II II II 



Db 



1409 GACACACAGTACAGCAAGGTCCTTGCCCTCTACAACCAACACAACCCGGGTGCTTCCGCA 1468 



Qy 


1081 


Db 


1469 


Qy 


1141 


Db 


1529 



llllllllllllllllll lllllllllllllllllll llllllllllllllllll 



RESULT 6 
US-09-948-002-1 

; Sequence 1, Application US/09948002 
; Publication No. US20030050265A1 
; GENERAL INFORMATION: 
; APPLICANT: Nicholas M. Dean 

; APPLICANT: Susan F. Murray 

; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
; TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE: ISPH-0607 

; CURRENT APPLICATION NUMBER: US/O 9/ 94 8 , 002 

; CURRENT FILING DATE: 2000-09-05 

; PRIOR APPLICATION NUMBER: 09/661,753 

; PRIOR FILING DATE: 2000-09-14 

; PRIOR APPLICATION NUMBER: 60/154,546 

; PRIOR FILING DATE: 1999-09-17 

/ NUMBER OF SEQ ID NOS : 71 

; SEQ ID NO 1 

LENGTH: 2094 

TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME/KEY: CDS 
LOCATION: ( 868 ) . . . (204 0 ) 
US-09-948-002-1 

Query Match 70.8%; Score 847.8; DB 11; Length 2094; 

Best Local Similarity 82.8%; Pred. No. 1.2e-227; 

Matches 991; Conservative 0; Mismatches 182; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM llllllllllllll lllllll Illll II II I llllll llllll 
Db 868 ATGCCGCCCTCGGGGCTGCGGCTACTGCCGCTTCTGCTCCCACTCCCGTGGCTTCTAGTG 927 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

llllllll II III! Illll Illll llllllllllllllllllllllllllllll 

Db 928 CTGACGCCCGGGAGGCCAGCCGCGGGACTCTCCACCTGCAAGACCATCGACATGGAGCTG 987 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

Mill IIIIIIIIIMIII Mill M llllllll IMIIIII II MM III 

Db 988 GTGAAACGGAAGCGCATCGAAGCCATCCGTGGCCAGATCCTGTCCAAACTAAGGCTCGCC 1047 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

II Mill IIIIIMIIII II lllllllllllllllll Illll II II Ml I 

Db 104 8 AGTCCCCCAAGCCAGGGGGAGGTACCGCCCGGCCCGCTGCCCGAGGCGGTGCTCGCTTTG 1107 



Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IMMIII IIIIIIIIIIMII II II II II I III II Mill nil! II 

Db 1108 TACAACAGCACCCGCGACCGGGTGGCAGGCGAGAGCGCCGACCCAGAGCCGGAGCCCGAA 1167 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

lllllllllll II II IIIIIIIIIIIIIIIMIIIIIIIM II MM III 

Db 1168 GCGGACTACTATGCTAAAGAGGTCACCCGCGTGCTAATGGTGGACCGCAACAACGCCATC 1227 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

MM! Ml Ml Mill Mill IIMMM I Mill Mill II I 

Db 1228 TATGAGAAAACCAAAGACATCTCACACAGTATATATATGTTCTTCAATACGTCAGACATT 12 87 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 80 

Mllllll Mill Mill llllll II II IMIIIIIIIII III III I 

Db 1288 CGGGAAGCAGTGCCCGAACCCCCATTGCTGTCCCGTGCAGAGCTGCGCTTGCAGAGATTA 1347 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 540 

II I II Mllllll II Mill II lllllllllll Mill llllllllll 

Db 1348 AAATCAAGTGTGGAGCAACATGTGGAACTCTACCAGAAATATAGCAACAATTCCTGGCGT 14 07 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

IMM I IMMIMMM IIIMI III I II IMMMIMI Mill Ml 

Db 1408 TACCTTGGTAACCGGCTGCTGACCCCCACTGATACGCCTGAGTGGCTGTCTTTTGACGTC 1467 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

II IIMMM IMllllllllll II MM I Ml MM Mill MM 

Db 14 68 ACTGGAGTTGTACGGCAGTGGCTGAACCAAGGAGACGGAATACAGGGCTTTCGATTCAGC 1527 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 720 

II MM M I IIIIMIIMMIIIII IIIMMIIMMM llllll III 

Db 1528 GCTCACTGCTCTTGTGACTVGCAAAGATAACAAACTCCACGTGGAAATO^ 1587 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I II Mill IIIIIM llllll II I lllllllllllllllMlllllllI 

Db 1588 CCCAAACGTCGGGGCGACCTGGGCACCATCCATGACATGAACCGGCCCTTCCTGCTCCTC 1647 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTAC 84 0 

IIMIIIMII Mill llMIMMllllMllllllll Mllllll II 

Db 164 8 ATGGCCACCCCCCTGGAAAGGGCCCAGCACCTGCACAGCTCACGGCACCGGAGA 1701 

Qy 841 AAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 900 

lllllllllllllllll lllllllllMlll lllllllll 

Db 1702 GCCCTGGATACCAACTATTGCTTCAGCTCCACAGAGAAGAAC 1743 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

Mill IMIIIIIIM lllllllllll MMMIMMM lllllllllll II 

Db 1744 TGCTGTGTGCGGCAGCTGTACATTGACTTTAGGAAGGACCTGGGTTGGAAGTGGATCCAC 1803 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II IMIMMMIIMIIMM Mill Mill Mill Mill II IIMMM 

Db 18 04 GAGCCCAAGGGCTACCATGCCAACTTCTGTCTGGGACCCTGCCCCTATATTTGGAGCCTG 1863 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

Mill IIMIIIIIMMIIII II II lllllMl llMIIMMIIll Mllll 

Db 1864 GACACACTVGTACAGCAAGGTCCTTGCCCTCTACAACCTWVCACAACCCGGGCGCTTCGG 1923 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1140 



Db 




Qy 



1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 



Db 




RESULT 7 

US-09-756-283A-19 

; Sequence 19, Application US/09756283A 

; Patent No. US20020151478A1 

; GENERAL INFORMATION: 

; APPLICANT: Cherna j ovsky , Yuti 

APPLICANT: Dreja, Hanna Stina 
; APPLICANT: Adams, Gillian 
; TITLE OF INVENTION: Latent Fusion Protein 
; FILE REFERENCE: 0623.1000000 

; CURRENT APPLICATION NUMBER: US/ 09/ 756 , 283A 
; CURRENT FILING DATE: 2001-01-09 
; NUMBER OF SEQ ID NOS : 100 

SOFTWARE: Patentin version 3.0 
; SEQ ID NO 19 

LENGTH: 1376 

TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: LAP-mlFNbeta construct 
NAME/KEY: CDS 
LOCATION: (1) . . (1368) 
US-09-756-283A-19 

Query Match 54.6%; Score 654; DB 10; Length 1376; 

Best Local Similarity 88.1%; Pred . No. 2.4e-173; 

Matches 724; Conservative 0; Mismatches 95; Indels 3; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 



Db 




Qy 



61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 



Db 




Qy 



121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 



Db 




Qy 



181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 



Db 




Qy 



241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 



Db 




Qy 



301 



GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 



360 



Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


478 


Db 


481 


Qy 


538 


Db 


541 


Qy 


598 


Db 


601 


Qy 


658 


Db 


661 


Qy 


718 


Db 


721 


Qy 


778 


Db 


781 



M IIIMIIIIMIIIIIIIMIIIIIIIMIIIIIIIIIIIIII I MM MMI 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 360 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

Mill II Mllll I I llllll llilllll I llllllil II IIMII 

TATGACAAGTTCT^GCAGAGTACACACAGCATATATATGTTCTTCAACACATC^ 420 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT GAGG 4 77 

M MMMM M MMI M MMMM MMMMMMM MMI MM 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGG 48 0 
CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGG 537 

MMMMIMMMMMMIMMMMM IMIIIMMIIIIMI IMMIM 

CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGG 54 0 
CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 597 

II IIIIMIIIIIIIIIIIIIIIII Mill Mill II Mllll I II llllll 

CGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGAT 600 

GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 657 

IMIIIMIIIIIIIIIMIIIM III Ml Ml M II Mill IIIIIMI 

GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 66 0 
AGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 717 

II lllllll MM IMMIM IIIIMIMII II Mill II IIIIIIIM 

AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC 720 



IMMIM MIMIIIIMIIMIMII IIIIIIIMIIIM MMMM 



CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGC 819 

IIIIMMIMIMIIIMMMMIMMM MMI Ml 

CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGC 822 



RESULT 8 

US-09-756-283A-21 

; Sequence 21, Application US/09756283A 

; Patent No. US20020151478A1 

; GENERAL INFORMATION: 

; APPLICANT: Cherna j ovsky , Yuti 

; APPLICANT: Dre j a , Hanna Stina 

; APPLICANT: Adams, Gillian 

; TITLE OF INVENTION: Latent Fusion Protein 

FILE REFERENCE: 0623.1000000 
; CURRENT APPLICATION NUMBER: US/09/756 , 283A 
; CURRENT FILING DATE: 2001-01-09 
; NUMBER OF SEQ ID NOS : 100 
/ SOFTWARE: PatentIn version 3.0 
; SEQ ID NO 21 

LENGTH: 1352 

TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: mIFNbeta-LAP construct 
NAME/ KEY: CDS 



LOCATION: (1) . . (1344) 
US-09-756-283A-21 



Query Match 49.4%; Score 591.4; DB 10; Length 1352; 

Best Local Similarity 87.0%; Pred. No. 8.6e-156; 

Matches 662; Conservative 0; Mismatches 96; Indels 3; Gaps 1; 

GACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTGGT 122 

III M I III MM III llllllllllllll lllllllllllllllll 

GGCGGGAGGGGGCTCAGCGGCCGCACTATCCACCTGCAAGACTATCGACATGGAGCTGGT 641 

GAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAG 182 

IIIIIIIIIIIIIIIMIIIMII IIIIMIIIII IIMIIIIIII Mill Mill 

GAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAG 701 
CCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTA 242 

IIIMIIIIIIIIIIIII llllllllllllllllllll Mill M II II II II 

CCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTA 761 
CAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGC 302 

IIIMI llllllllllllll llllllll MM IIIIIIIIIMIIIIII Mill 

CAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGC 821 

GGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTA 362 

lllllllllllllllllllllllllllllllllllllllllll I MM lllllll 
CGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTA 881 

TGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCG 422 

III M IIIMI I I IIIMI llllllll I IIIIMII II llllllll 

TGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTCCG 941 
GGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT GAGGCT 4 7 9 

llllllll II Mill II IIIIMII llllllllllllll Mill Mill! 

AGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGGCT 1001 
CAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCG 539 

MIIIIIMIIIIIIIIIIIMIIIIIIII lllllllllllllllll llllllllll 

CAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCG 1061 
CTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGT 599 

IIIIIIIIIIIIIIMIIIIIII Mill Mill II IIIMI I II llllllll 

ATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGT 1121 

CACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAG 659 

llllllllllllllllllllll III III III II M Mill llllllll II 

CACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAG 1181 

TGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAA 719 

lllllll MM llllllll IIIIMIIIII II Mill II llllllllll 
CGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCAC 1241 

TTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCT 779 

I I llllllll llllllllllllllllllll llllllllllllll llllllll II 

TACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCT 1301 
CATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCT 820 

llllllllllllllllllllllllllllll nil! MM 



Qy 


63 


Db 


582 


Qy 


123 


Db 


642 


Qy 


183 


Db 


702 


Qy 


243 


Db 


762 


Qy 


303 


Db 


822 


Qy 


363 


Db 


882 


Qy 


423 


Db 


942 


Qy 


480 


Db 


1002 


Qy 


540 


Db 


1062 


Qy 


600 


Db 


1122 


Qy 


660 


Db 


1182 


Qy 


720 


Db 


1242 


Qy 


780 



Db 1302 CATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCT 1342 



RESULT 9 

US-09-911-904-167 

Sequence 167, Application US/09911904 
Publication No, US20030096234A1 
GENERAL INFORMATION: 
APPLICANT: Farr, Spencer B. 
APPLICANT: Pickett, Gavin G. 
APPLICANT: Neft, Robin Eileen 
APPLICANT: Dunn, II, Robert Thomas 
TITLE OF INVENTION: CANINE TOXICITY GENES 
FILE REFERENCE: 400742000200 
CURRENT APPLICATION NUMBER: US/09/911,904 
CURRENT FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: US 60/220,057 
PRIOR FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 386 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 167 
LENGTH: 489 
TYPE : DNA 

ORGANISM: Canis familiaris 
FEATURE : 

NAME/KEY : misc_f eature 
LOCATION: (1) . . . (489) 

OTHER INFORMATION: n = A, T, C or G 
US-09-911-904-167 

Query Match 28.7%; Score 344; DB 11; Length 489; 

Best Local Similarity 89.9%; Pred. No. 1.6e-86; 

Matches 390; Conservative 0; Mismatches 20; Indels 24; Gaps 1; 

Qy 764 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCC 823 

I IIIIIMIIIIIIIIIIIIIIIIIII lllllllllllllllllllllllllllllll 
Db 1 GACCCTTCCTGCTCCTCATGGCCACCCCACTGGAGAGGGCCCAGCACCTGCACAGCTCCC 60 

Qy 824 GGCACCGCCGAGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCA 883 

MM MMI MMMMI MMMMMMMM 

Db 61 GGCAGCGCCG GGCCCTGGACACCAACTACTGCTTCA 96 

Qy 884 GCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGG 943 

MMMMMMMMMMMMI MMMMMMMMMMMI MMI MM 

Db 97 GCTCCACGGAGAAGAACTGCTGCGTCCGGCAGCTCTACATTGACTTCCGCAAGGATCTGG 156 

Qy 944 GCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTC 1003 

MMMMMIM MMI IIIMIII Mill II II IIIIMIIIMIIIIII I 

DO 157 GCTGGAAGTGGATCCATGAGCCCAAGGGTTACCACGCTAACTTCTGCCTGGGGCCCTGCC 216 

Qy 10 04 CCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACA 1063 

lllllll llllllll Mill llllllllllllllllllll MMMMMMMM 

Db 217 CCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCACA 27 6 

Qy 1064 ACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCG 1123 

MMMMMIIMIIMMMIIMIIIIMMIIIMMMIMMIIIIMIIMM 

Db 277 ACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCG 336 



Qy 


1124 


TGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCT 

IIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIMIIIII IIIIIIIIIIIMI MM 

TGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCGAAGVTGATCGTGCGCTCCT 


1183 


Db 


337 


396 


Qy 


1184 


GCAAGTGCAGCTGA 1197 

MMMMMMM 

GCAAGTGCAGCTGA 410 




Db 


397 





RESULT 10 
US-09-813-271B-1 

; Sequence 1, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No . US2002 0115834Alart is Patent Department 

STREET: 564 Morris Avenue 

CITY: Summit 
; STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/8 13 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/0271 9 

FILING DATE: 12 -Jul -95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25-Jul-94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 22640 

REFERENCE/DOCKET NUMBER: 4 -20039C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 339 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: NO 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu.hTGF-betal (DSM 5656) 



FEATURE : 

NAME/KEY: CDS 
LOCATION: 1, .336 

OTHER INFORMATION: /product= "human TGF-betal" 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-813-271B-1 

Query Match 25.8%; Score 308.6; DB 10; Length 339; 

Best Local Similarity 94.4%; Pred. No. 1.3e-76; 

Matches 320; Conservative 0/ Mismatches 19; Indels 0; Gaps 0; 

Qy 859 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 918 

llllllll llllllll IIIIIMMIIIIIIIIIIIIIIIMIIIIIIIIMIIIII 

Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 60 

Qy 919 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 978 

llllllllllllll llllllll llllllllllllll II II lllllllllllllll 
Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 120 

Qy 979 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 1038 

III!! llllllll llllllll llllllll llllllll Mill llllllllllll 

Db 121 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 180 

Qy 1039 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 1098 

llllllll llllllllllllll lllllllllll llllllllllllllllllllllll 
Db 181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 24 0 



Qy 1099 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1158 

llllllllllllll IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 241 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 00 

Qy 1159 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

llllllllllllllllllll llllllllllllllllll 
Db 301 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 33 9 



RESULT 11 
US-09-813-271B-7 

; Sequence 7, Application US/09813271B 
; Patent No, US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 
; biologically active protein 

NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US2 002 01 15834Alartis Patent Department 

STREET: 564 Morris Avenue 

CITY: Summit 
; STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 



SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/8 13 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/027 1 9 

FILING DATE: 12 -Jul -95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25 -Jul -94 
ATTORNEY /AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. , 

REGISTRATION NUMBER: 22640 

REFERENCE/DOCKET NUMBER: 4 -2003 9C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 336 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid 

DESCRIPTION: /desc = "recombinant hybrid DNA of 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu . TGF-betal (44/45 ) beta3 
FEATURE : 

NAME/KEY: mat_peptide 

LOCATION: 1 . . 132 

OTHER INFORMATION: /product= "N- terminal 44 amino 
acids of human TGF-betal" 
FEATURE : 

NAME/KEY: mat_peptide 
LOCATION: 133 . .336 

OTHER INFORMATION: /product = "C-terminal 68 amino 
acids of human TGF-beta3" 
FEATURE : 

NAME/KEY: CDS 
LOCATION: 1. .336 

OTHER INFORMATION: /product = "hybrid TGF-beta named 
TGF-betal-3" 
SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
US-09-813-271B-7 

Query Match 19.0%; Score 227.2; DB 10; Length 336; 

Best Local Similarity 79.8%; Pred, No. 8.6e-54; 

Matches 268; Conservative 0; Mismatches 68; Indels 0; Gaps 0; 
Qy 859 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 918 



Db 




Qy 



919 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 978 



Db 




Qy 979 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 1038 



mill I 

^CAGCACG 180 



Db 


121 


GCCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCAC^ 


180 


Qy 


1039 


GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 
II 1 1 1 1 lllllllll 1 lllll 1 II II II 1 1 1 1 lllllllllll 

II II 1 1 i 1 II II II 1 1 lllll 1 II II II 1 1 1 1 1 M M 1 M M 1 

GTGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCC 


1098 


Db 


181 


240 


Qy 
Db 


1099 
241 


CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 

MM MMMM Ml MMI MMMI MM II IIIM lllllllll 

CAGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAG 


1158 
300 


Qy 


1159 


CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGC 1194 

II lllllllll 1 III M II II II III 

CTCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGC 336 




Db 


301 





RESULT 12 

US-09-957-458B-9 

Sequence 9, Application US/09957458B 
Publication No. US20030166271A1 
GENERAL INFORMATION: 
APPLICANT: Chen, Una 

TITLE OF INVENTION: Method for growing stem cells 
FILE REFERENCE: P66567US0 

CURRENT APPLICATION NUMBER: US/09/957 , 458B 
CURRENT FILING DATE: 2001-09-21 
PRIOR APPLICATION NUMBER: PCT/EPOO/08247 
PRIOR FILING DATE: 2000-08-24 
PRIOR APPLICATION NUMBER: EP 99116533 
PRIOR FILING DATE: 1999-08-24 
NUMBER OF SEQ ID NOS : 10 
SOFTWARE: Patent In version 3.2 
SEQ ID NO 9 
LENGTH: 4 382 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Vector for transforming supporting cell with a foreign to 
express 

OTHER INFORMATION: a gene product of interest 
US-09-957-458B-9 

Query Match 18.8%; Score 225.6; DB 12; Length 4382; 

Best Local Similarity 52.2%; Pred. No. 4.9e-53; 

Matches 652; Conservative 0; Mismatches 524; Indels 72; Gaps 4; 

Qy 16 CTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGG 75 

MM II I III III I I III I III MUM I I 

Db 455 CTGCAGCCCATGCACTTGCAAAGGGCTCTGGTAGTCCTGGCCCTGCTGAACTTGGCCACA 514 

Qy 76 CCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGC 135 

I I MMMM MM II I III I I I I III MM I 

Db 515 ATCAGCCTCTCTCTGTCCACTTGCACCACGTTGGACTTCGGCCACATCAAGAAGAAGAGG 574 

Qy 136 ATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAG 195 

I II IIIIM I II lllll II IIIIM MM IIIIIIIMI I 

Db 575 GTGGAAGCCATTAGGGGACAGATCTTGAGCAAGCTCAGGCTCACCAGCCCCCCTGAGCCA 634 



Qy 196 GGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGC 255 

II Ml I M III I II Mill IIIIIIIIIM MM! 

Db 635 TCGGTGATGACCCACG TCCCCTATCAGGTCCTGGCACTTTACAACAGCACCCGG 688 

Qy 256 GACCGGGTAGCCG GGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

M II I I MM II III I III 

Db 68 9 GAGTTGCTGGAAGAGATGCACGGGGAGAGGGAGGAAGGCTGCACTCAGGAGACCTCGGAG 748 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

I M Mill Mill Ml II I III I I I II 

Db 74 9 TCTGAGTACTATGCCAAAGAGATCCATAAATTCGACATGATCCAGGGACTGGCGGAGCAC 8 08 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

MM I II M I I II I Ml III 

Db 8 09 AATGAACTGGCCGTCTGCCCCAAAGGAATTACCTCTAAGGTTTTTCGTTTCAATGTGTCC 868 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

III III MM I I I MM! II I 

Db 869 TCAGTGGAGAAAAATGGAACCAATCTGTTCCGGGCAGAGTTCCGGGTCTTGCGGGTGCCC 928 
Qy 481 A AGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAA 519 

I II I II MM III! Ill 

Db 92 9 AACCCCAGCTCCAAGCGCACAGAGCAGAGAATTGAGCTCTTCCAGATACTTCGACCGGAT 988 

Qy 52 0 TACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCG 579 

I II I lllllll II II III II I II I I I I 

Db 989 GAGCACATAGCCAAGCAGCGCTACATAGGTGGCAAGAATCTGCCCACAAGGGGCACCGCT 1048 

Qy 58 0 GAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCT 639 

II llllllll II MIIIMI I llllll IIIMIII I IIIMI I 

Db 104 9 GAATGGCTGTCTTTCGATGTCACTGACACTGTGCGCGAGTGGCTGTTGAGGAGAGAGTCC 1108 
Qy 64 0 ATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGC 681 

I MM I MM Mill llllllll 

Db 1109 AACTTGGGTCTGGAAATCAGCATCCACTGTCCATGTCACACCTTTCAGCCCAATGGAGAC 1168 
Qy 682 AAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGC 72 9 

III I I I II I M II I I II 

Db 1169 ATACTGGAAAATGTTCATGAGGTGATGGAAATCAAATTCAAAGGAGTGGACAATGAAGAT 1228 

Qy 730 CGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACC 789 

II II I II llllll I II II I I 

Db 122 9 GACCATGGCCGTGGAGACCTGGGGCGTCTCAAGAAGCAAAAGGATCACCACAACCCACAC 1288 

Qy 790 CCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTACAAGGATGAC 84 9 

III II I II III I lllllll 

Db 128 9 CTGATCCTCATGATGATCCCCCCACACCGACTGGACAGCCCAGGCCAGGGCAGTCAGAGG 1348 

Qy 850 GACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTG 909 

I I I lllllllll Mill Mlllllll M I MM IIIIIMIII II 

Db 134 9 AAGAAGAGGGCCCTGGACACCAATTACTGCTTCCGCAACCTGGAGGAGAACTGCTGTGTA 14 08 

Qy 910 CGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAG 96 9 

M I II II llllllllllll MM II IIIIIMI III I II Mill III 

Db 14 09 CGCCCCCTTTATATTGACTTCCGGCAGGATCTAGGCTGGAAATGGGTCCACGAACCTAAG 1468 



Qy 970 GGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAG 102 9 

II III iiiiiii mill II II II II III II I III Mill! 

Db 14 6 9 GGTTACTATGCCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGCGCAGACACAACC 152 8 

Qy 103 0 TACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGC 1089 

I MM III II I II Mllll I Mill I Mill II MM III 

Db 1529 CATAGCACGGTGCTTGGACTATACAACACCCTGAACCCAGAGGCGTCTGCCTCGCCATGC 1588 

Qy 1090 TGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAG 114 9 

IMM II MM MMMM Ml Mill MIIMI IMMI I I IMMI 

Db 158 9 TGCGTCCCCCAGGACCTGGAGCCCCTGACCATCTTGTACTATGTGGGCAGAACCCCCAAG 164 8 

Qy 1150 GTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

MMIMMMMMMMM I III II II llllllllllll 

Db 164 9 GTGGAGCAGCTGTCCAACATGGTGGTGAAGTCGTGTAAGTGCAGCTGA 1696 



RESULT 13 
US-09-906-158-3 

Sequence 2, Application US/09906158 
Publication No. US20030078217A1 
GENERAL INFORMATION: 
APPLICANT: Brett P. Monia 
APPLICANT: Susan M. Freier 

TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 

EXPRESSION 

FILE REFERENCE: RTS-0257 

CURRENT APPLICATION NUMBER: US/09/906,158 
CURRENT FILING DATE: 2 001-07-14 
NUMBER OF SEQ ID NOS : 168 
SEQ ID NO 3 
LENGTH: 2574 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (254) . . . (1492) 
US-09-906-158-3 

Query Match 18.7%; Score 224.4; DB 11; Length 2574; 

Best Local Similarity 53.3%; Pred. No. 9.2e-53; 

Matches 660; Conservative 0; Mismatches 506; Indels 72; Gaps 6; 

Qy 26 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 85 

III III I I III M Ml IMMI II I I II 

Db 261 TGCACTTGCAAAGGGCTCTGGTGGTCCTGGCCCTGCTGAACTTTGCCACGGTCAGCCTCT 3 20 

Qy 86 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 145 

IMIIIII MM Ml IIIIIII I III MM I I II MM 

Db 321 CTCTGTCCACTTGCACCACCTTGGACTTCGGCCACATCAAGAAGAAGAGGGTGGAAGCCA 38 0 

Qy 146 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 205 

II I II Mill II IMMI MM llllllllll I II II 

Db 381 TTAGGGGACAGATCTTGAGCAAGCTCAGGCTCACCAGCCCCCCTGAGCCAACGGTGATGA 44 0 

Qy 206 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 265 

I I II III I II Mill IIIIIMIIM Mill II I I II 



Db 



441 CCCACG TCCCCTATCAGGTCCTGGCCCTTTACAACAGCACCCGGGAGCTGCTGG 4 94 



Qy 2 66 CCGGGGAAAGTGTCGAACCGGAGCCCG AGCCAGAGGCGGACTACT 310 

I I II II MM I I MM MM MM 

Db 4 95 AGGAGATGCATGGGGAGAGGGAGGAAGGCTGCACCCAGGAAAACACCGAGTCGGAATACT 554 

Qy 311 ACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAAT 370 

I MMI Mil I Ml! I I I II M 

Db 555 ATGCC^W^GAAATCCATAAATTCGACATGATCCAGGGGCTGGCGGAGCACAACGAACTGG 614 

Qy 3 71 TCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 430 

Ml I IIIIIIII IIIIII 

Db 615 CTGTCTGCCCTAAAGGAATTACCTCCAAGGTTTTCCGCTTCAATGTGTCCTCAGTGGAGA 674 

Qy 431 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAG 4 90 

M III M I I MM I M Ml I 

Db 675 AAAATAGAACCAACCTATTCCGAGCAGAATTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 734 

Qy 491 TGGAGCAGCACGTGGAG CTATACCAGAAATACAGCAATG 52 9 

MM I I I II II I I I M II 

Db 735 CTAAGCGGAATGAGCAGAGGATCGAGCTCTTCCAGATCCTTCGGCCAGATGAGCACATTG 794 

Qy 530 ATTCCTGGCGCTACCTCAGCA^CCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 58 9 

IIIIII III II III II I II I I I Mllllllll 

Db 795 CCAAACAGCGCTATATCGGTGGCAAGAATCTGCCCACACGGGGCACTGCCGAGTGGCTGT 854 

Qy 590 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 64 9 

IMMMMMM I mm MMMM I MMM I I Ml 

Db 855 CCTTTGATGTCACTGACACTGTGCGTGAGTGGCTGTTGAGAAGAGAGTCCAACTTAGGTC 914 

Qy 650 TTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACA CACTCCACGTGG 703 

I MM MM IIIIIIII III III I 

Db 915 TAGAAATCAGCATTCACTGTCCATGTCACACCTTTCAGCCCAATGGAGATATCCTGGAAA 974 

Qy 7 04 AAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACC 763 

MM III II I I II III II III II 

Db 975 ACATTCACGAGGTGATGGAAATCAAATTCAAAGGCGTGGACAATGAGGATGACCATGGCC 1034 

Qy 764 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGC 819 

I IIIIII IIIIII II I I I I I 

Db 1035 GTGGAGATCTGGGGCGCCTCAAGAAGCAGAAGGATCACCACAACCCTCATCTAATCCTCA 1094 

Qy 820 TCCCGGCACCGCCGAGACTACAAGGATGACG ACGACAAGG 859 

III Mill I mil II II II II 

Db 1095 TGATGATTCCCCCACACCGGCTCGACAACCCGGGCCAGGGGGGTCAGAGGAAGAAGCGGG 1154 

Qy 860 CCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCT 919 

I MM mil lllllllll II I MM IIIIIIIIM Mill I MM 

Db 1155 CTTTGGACACCAATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCCTCT 1214 

Qy 920 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 979 

IIIIIIIIIIIII MM IIIMIIIMIIIII I IMIIIII lllllllll III 

Db 1215 ACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACTATG 1274 

Qy 980 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 1039 

MM IIIIII II II II II III II I II IIIIII IIIIII II 

Db 1275 CCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCACGG 1334 



Qy 104 0 TCCTGGCTCTGTACTIACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGC 1099 

I MM MIIIIMI I Mill I II II M III! Illllllllll I 

Db 1335 TGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCCC 13 94 

Qy 1100 AGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGC 1159 

III llllllll III Mill IIIIIM II II II Mill IIIMIIIII 

Db 13 95 AGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAGC 1454 

Qy 1160 TGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

I IIIIIIIII I III II II II II Mllll 
Db 1455 TCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGA 14 92 



RESULT 14 
US-10-028-158-20 

; Sequence 20, Application US/10028158 

; Publication No. US20020110833A1 

; GENERAL INFORMATION: 

; APPLICANT: Caniggia, Isabella 

; APPLICANT: Post, Martin 

; APPLICANT: Lye, Stephen 

; TITLE OF INVENTION: METHODS TO DIAGNOSE A REQUIRED REGULATION OF 

; TITLE OF INVENTION: TROPHOBLAST 

; FILE REFERENCE: 11757. 38USWO 

; CURRENT APPLICATION NUMBER: US/10/028,158 

; CURRENT FILING DATE: 2 001-12-2 0 

; PRIOR APPLICATION NUMBER: US/09/380 , 662 

; PRIOR FILING DATE: 1999-12-21 

; PRIOR APPLICATION NUMBER: PCT/CA98 /0018 0 

; PRIOR FILING DATE: 1998-03-05 

; PRIOR APPLICATION NUMBER: US 60/039,919 

; PRIOR FILING DATE: 1997-03-07 

; NUMBER OF SEQ ID NOS : 24 

SOFTWARE: Patentin version 3.0 
; SEQ ID NO 20 

LENGTH: 2574 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE : 

NAME/ KEY: CDS 

LOCATION: (254) (1492) 
US-10-028-158-20 

Query Match 18.7%; Score 224.4; DB 13; Length 2574; 

Best Local Similarity 53.3%; Pred. No. 9.2e-53; 

Matches 660; Conservative 0; Mismatches 506; Indels 72; Gaps 6; 
Qy 26 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 85 

III III I I III II III Mllll III I II 

Db 261 TGCACTTGCAAAGGGCTCTGGTGGTCCTGGCCCTGCTGAACTTTGCCACGGTCAGCCTCT 32 0 

Qy 86 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 145 

llllllll MM Ml I III II I I Ml MM I I II MM 

Db 321 CTCTGTCCACTTGCACCACCTTGGACTTCGGCCACATCAAGAAGAAGAGGGTGGAAGCCA 380 



Qy 146 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 2 05 



II I II mil II MUM MM iiiiiiiiii I II II 

Db 381 TTAGGGGAO^GATCTTGAGCAAGCTCAGGCTCACCAGCCCCCCTGAGCCAACXSGTGATGA 44 0 

Qy 206 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 265 

II II I II I II Mill IIIIIMIIII Mill MINI 

Db 441 CCCACG TCCCCTATCAGGTCCTGGCCCTTTACAACAGCACCCGGGAGCTGCTGG 4 94 

Qy 266 CCGGGGAAAGTGTCGAACCGGAGCCCG AGCCAGAGGCGGACTACT 310 

II MM MM I I MM MM MM 

Db 495 AGGAGATGCATGGGGAGAGGGAGGAAGGCTGCACCCAGGAAAACACCGAGTCGGAATACT 554 

Qy 311 ACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAAT 370 

I MM! II II I III! I I I II II 

Db 555 ATGCCAAAGAAATCCATAAATTCGACATGATCCAGGGGCTGGCGGAGCACAACGAACTGG 614 

Qy 371 TCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 430 

III I I I I I III I I I I I I I 

Db 615 CTGTCTGCCCTAAAGGAATTACCTCCAAGGTTTTCCGCTTCAATGTGTCCTCAGTGGAGA 674 

Qy 431 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAG 4 90 

II III I I II I MM M III I 

Db 675 AAAATAGAACCAACCTATTCCGAGCAGAATTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 734 

Qy 4 91 TGGAGCAGCACGTGGAG CTATACCAGAAATACAGCAATG 52 9 

MM II III II I I I MM 

Db 735 CTAAGCGGAATGAGCAGAGGATCGAGCTCTTCCAGATCCTTCGGCCAGATGAGCACATTG 794 

Qy 530 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 589 

llllll II I II Mini II II I IIIIIIIIM 

Db 795 CCAAACAGCGCTATATCGGTGGCAAGAATCTGCCCACACGGGGCACTGCCGAGTGGCTGT 854 

Qy 590 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 64 9 

IMIMMMMI I MMM MMIIM I lIMM I I Ml 

Db 855 CCTTTGATGTCACTGACACTGTGCGTGAGTGGCTGTTGAGAAGAGAGTCCAACTTAGGTC 914 
Qy 650 TTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACA CACTCCACGTGG 703 

I MM MM I II MM I III III I 

Db 915 TAGAAATCAGCATTCACTGTCCATGTCACACCTTTCAGCCCAATGGAGATATCCTGGAAA 974 

Qy 704 AAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACC 763 

MM MM II I II III M III II 

Db 975 ACATTCACGAGGTGATGGAAATCAAATTCAAAGGCGTGGACAATGAGGATGACCATGGCC 1034 
Qy 764 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGC 819 

I Ml I M I M III II I I I M 

Db 1035 GTGGAGATCTGGGGCGCCTCAAGAAGCAGAAGGATCACCACAACCCTCATCTAATCCTCA 1094 

Qy 820 TCCCGGCACCGCCGAGACTACAAGGATGACG ACGACAAGG 8 59 

III Mill I Mill II II II II 

Db 1095 TGATGATTCCCCCACACCGGCTCGACAACCCGGGCCAGGGGGGTCAGAGGAAGAAGCGGG 1154 

Qy 860 CCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCT 919 

I MM Mill IIMIIIII II I MM IMIIIIMI IIMI I MM 

Db 1155 CTTTGGACACCAATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCCTCT 1214 

Qy 920 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 979 

MIMMMMM MM lllllllllllllll I MMIIM IIMIIIII Ml 



Db 



1215 ACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACTATG 1274 



Qy 980 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 103 9 

INI IIIIM II II II II III II I II IIIIII IIIIII II 

Db 1275 CCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCACGG 1334 

Qy 104 0 TCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGC 1099 

I MM IIMIIIII I IIIII I II II II MM IIMIIIIMI I 

Db 133 5 TGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCCC 1394 

Qy 1100 AGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGC 1159 

III Mllllll III IIIII lllllll II II II IIIII llllllllll 
Db 1395 AGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAGC 1454 

Qy 1160 TGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

I IIIIIMII I III M II II II IIIIII 

Db 1455 TCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGA 14 92 



RESULT 15 
US-'09-906-158-10 

; Sequence 10, Application US/09906158 

; Publication No. US20030078217A1 

; GENERAL INFORMATION: 

; APPLICANT: Brett P. Monia 

APPLICANT: Susan M. Freier 
; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 

; FILE REFERENCE: RTS-0257 

; CURRENT APPLICATION NUMBER: US/09/906,158 
; CURRENT FILING DATE: 2 0 01-07-14 
; NUMBER OF SEQ ID NOS : 168 
; SEQ ID NO 10 

LENGTH: 287 9 

TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (611) ... (1843) 
US-09-906-158-10 

Query Match 18.4%; Score 220.4; DB 11; Length 2879; 

Best Local Similarity 52.1%; Pred. No. 1.3e-51; 

Matches 645; Conservative 0; Mismatches 521; Indels 72; Gaps 4; 

Qy 26 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 85 

III III IIIIII III IIIIII II II 

Db 612 TGCACTTGCAAAGGGCTCTGGTAGTCCTGGCCCTGCTGAACTTGGCCACAATCAGCCTCT 671 

Qy 86 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 145 

llllllll MM II I III I I I I III MM I I II MM 

Db 672 CTCTGTCCACTTGCACCACGTTGGACTTCGGCCACATCAAGAAGAAGAGGGTGGAAGCCA 731 

Qy 146 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 205 

II I M Mill II IIIIM MM llllllllll I M II 

Db 732 TTAGGGGACAGATCTTGAGCAAGCTCAGGCTCACCAGCCCCCCTGAGCCATCGGTGATGA 791 



Qy 2 06 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 265 

MM Mil M mil IMIIIIIIII Mill II III 

Db 792 CCCACG TCCCCTATCAGGTCCTGGCACTTTACAACAGCACCCGGGAGTTGCTGG 845 

Qy 266 CCG GGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACT 310 

I MM M III lllllll MM 

Db 846 AAGAGATGCACGGGGAGAGGGAGGAAGGCTGCACTCAGGAGACCTCGGAGTCTGAGTACT 905 

Qy 311 ACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAAT 370 

I Mill Ml M lllllll II MM 

Db 9 06 ATGCCAAAGAGATCCATAAATTCGACATGATCCAGGGACTGGCGGAGCACAATGAACTGG 965 

Qy 371 TCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 43 0 

I MM I MM III M I I II 

Db 966 CCGTCTGCCCCAAAGGAATTACCTCTAAGGTTTTTCGTTTCAATGTGTCCTCAGTGGAGA 1025 
Qy 431 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCA 481 

Ml MM I II I MM M M 

Db 1026 AAAATGGAACCAATCTGTTCCGGGCAGAGTTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 1085 
Qy 482 AGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATG 52 9 

II Ml MM I I II I M I II I 

Db 1086 CCAAGCGCACAGAGCAGAGAATTGAGCTCTTCCAGATACTTCGACCGGATGAGCACATAG 1145 

Qy 53 0 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 589 

IIIIIM II II llllll III II III lllllll 

Db 114 6 CCAAGCAGCGCTACATAGGTGGCAAGAATCTGCCCACAAGGGGCACCGCTGAATGGCTGT 12 05 

Qy 590 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 64 9 

I M IIIIMM I llllll IIIIIIM I Mini 1 I MM 

Db 1206 CTTTCGATGTCACTGACACTGTGCGCGAGTGGCTGTTGAGGAGAGAGTCCAACTTGGGTC 1265 
Qy 650 TTCGCCTCAGTGCCCACTCTTCCTCTGACAGC 681 

I MM Mill II II III I 

Db 1266 TGGAAATCAGCATCCACTGTCCATGTCACACCTTTCAGCCCAATGGAGACATACTGGAAA 1325 

Qy 682 --AAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACC 739 

III I I I II Mill I I II MM 

Db 1326 ATGTTCATGAGGTGATGGAAATCAAATTCAAAGGAGTGGACAATGAAGATGACCATGGCC 1385 

Qy 74 0 TGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGA 799 

I M llllll I M II I III I I 

Db 1386 GTGGAGACCTGGGGCGTCTCAAGAAGCAAAAGGATCACCACAACCCACACCTGATCCTCA 144 5 

Qy 8 00 GGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTACAAGGATGACGACGACAAGG 859 

I lllllll lllllll I II II 

Db 1446 TGATGATCCCCCCACACCGACTGGACAGCCCAGGCCAGGGCAGTCAGAGGAAGAAGAGGG 1505 

Qy BfeO CCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCT 919 

lllllll Mill lllllllll II I MM llllllllll II II I II I 
Db 1506 CCCTGGACACCAATTACTGCTTCCGCAACCTGGAGGAGAACTGCTGTGTACGCCCCCTTT 1565 

Qy 92 0 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 979 

I MIMIMIIII MM M IIIMIII III I M MM Mill III Ml 

Db 1566 ATATTGACTTCCGGCAGGATCTAGGCTGGAAATGGGTCCACGAACCTAAGGGTTACTATG 1625 

Qy 98 0 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 103 9 



Mil llllll II II II II III II 1 III III!!! I MM II 

Db 1626 CCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGCGCAGACAC/U^.CCCATAGCA 168 5 

Qy 1040 TCCTGGCTCTGTACAACCAGCACJVACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGC 1099 

III I II MUM I Mill I Mill II MM llllllll II I 

Db 1686 TGCTTGGACTATACAACACCCTGAACCCAGAGGCGTCTGCCTCGCCATGCTGCGTCCCCC 174 5 

Qy 1100 AGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGC 1159 

Ml MIMMI Ml IIMI MMMI llllll I I MIIIIIIIIMMM 

Db 174 6 AGGACCTGGAGCCCCTGACCATCTTGTACTATGTGGGCAGAACCCCCAAGGTGGAGCAGC 18 05 

Qy 1160 TGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IMMIIIIII I Ml II II IIIIIIMIIII 

Db 1806 TGTCCAACATGGTGGTGAAGTCGTGTAAGTGCAGCTGA 1843 



Search completed: October 28, 2003, 09:04:07 
Job time : 362.927 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



October 27, 2003, 10:47:27 ; Search time 2970.42 Seconds 

(without alignments) 
9794.056 Million cell updates/sec 

US-10-017-372E-8 
1197 

1 atggcgccttcggggctgcg gttcctgcaagtgcagctga 1197 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



45562784 



Database 



EST:* 



em_estba : * 
em^esthum: * 
em_estin : * 
em_estmu: * 
em__estov : * 
em__estpl : * 
em_estro : * 
em_htc : * 
gb estl:* 



10 


gb_est2 : * 




11 


gb_htc : * 




12 


gb_est3:* 




13 


gb_est4 : * 




14 


gb_est5 : * 




15 


em_estfun : * 




16 


em^estOTTi: * 




17 


em_gss_hum: 


★ 


18 


em_gss_inv: 


* 


19 


em_gss_pln : 


★ 


20 


em_gss_vrt : 


★ 


21 


em_gss_fun: 


★ 


22 


em_gs s_mam : 


★ 


23 


em_gss_mus : 


★ 


24 


em_gss_pro : 


★ 


25 


em_gss_rod: 


★ 


26 


em_gss_phg : 


★ 


27 


em_gss__vrl : 


★ 


28 


gb_gssl : * 




29 


gb_gss2 : * 





Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 





1 


729 


60 


9 


1072 


12 


BM562135 


BM562135 


AGENCOURT 


c 


2 


692.2 


57 


8 


1201 


13 


BX355682 


BX355682 


BX355682 


c 


3 


685.8 


57 


3 


983 


13 


BX335351 


BX335351 


BX335351 




4 


682 . 4 


57 


0 


900 


13 


BX349319 


BX349319 


BX349319 


c 


5 


629.6 


52 


6 


1041 


13 


BX324511 


BX324511 


BX324511 




6 


629.4 


52 


6 


1043 


12 


BM555996 


BM555996 


AGENCOURT 


c 


7 


603 .6 


50 


4 


950 


13 


BX383773 


BX383773 


BX383773 


c 


8 


589 


49 


2 


888 


13 


BX434425 


BX434425 


BX434425 




9 


577.2 


48 


2 


859 


9 


AL530081 


AL530081 AL530081 




10 


569 


47 


5 


713 


10 


BE312000 


BE312000 


601154768 




11 


561.2 


46 


9 


717 


10 


BE260971 


BE260971 


601153715 




12 


556 


46 


4 


902 


13 


BQ675698 


BQ675698 


AGENCOURT 




13 


534 .2 


44 


6 


925 


12 


BI818841 


BI818841 


603037307 




14 


521 .2 


43 


5 


1093 


12 


BQ054305 


BQ054305 


AGENCOURT 


c 


15 


520.2 


43 


5 


841 


9 


AL530080 


AL530080 AL530080 


c 


16 


520 


43 


4 


956 


12 


BI084718 


BI084718 


602869722 




17 


514 


42 


9 


1049 


12 


BQ053417 


BQ053417 


AGENCOURT 




18 


513 .6 


42 


9 


871 


13 


BQ952138 


BQ95213a 


AGENCOURT 




19 


506.4 


42 


3 


1013 


12 


BM469326 


BM469326 


AGENCOURT 




20 


504 . 6 


42 


2 


943 


12 


BI909079 


BI909079 


603070060 




21 


501.8 


41 


9 


821 


10 


BG748049 


BG74B049 


602705544 




22 


491.8 


41 


1 


773 


12 


BI195242 


BI195242 


602944518 


c 


23 


488 


40 


8 


1047 


13 


BX325288 


BX325288 


BX325288 




24 


486.8 


40 


7 


637 


12 


BG938856 


BG938856 


cn28g07 .y 




25 


482.8 


40 


3 


748 


10 


BP682938 


BF682938 


602117568 


c 


26 


482 .2 


40 


3 


757 


14 


CA309731 


CA309731 


UI-H-FTl- 




27 


474 .6 


39 


6 


659 


10 


BG500488 


BG500488 


602544826 





28 


472 . 


.8 


39. 


. 5 


675 


10 


BE261764 


BE261764 


601147839 




29 


464 . 


.2 


38. 


.8 


960 


13 


BU957001 


BU957001 


AGENCOURT 




30 


460 . 


.2 


38 . 


.4 


599 


12 


BI905963 


BI905963 


603062849 


c 


31 


455. 


.2 


38 , 


. 0 


727 


9 


AI131171 


AI131171 ( 


qclScll .X 




32 


454 . 


,2 


37 . 


. 9 


814 


12 


BI686959 


BI686959 


603313837 




33 


453 . 


.4 


37. 


. 9 


633 


14 


CB577186 


CB577186 


AMGNNUC : N 


c 


34 


453 . 


,2 


37 . 


. 9 


785 


9 


AI148173 


AI148173 ( 


qb56d01 .X 




35 


452 . 


.2 


37. 


. 8 


548 


12 


BM068788 


BM068788 


ie85a02 .y 




36 


449. 


.8 


37 , 


.6 


982 


13 


BQ672677 


BQ672677 


AGENCOURT 




37 


447 . 


.4 


37. 


. 4 


981 


12 


BM802192 


BM802192 


AGENCOURT 




38 


445. 


.8 


37 , 


.2 


588 


13 


BX097979 


BX097979 


BX097979 


c 


39 


445. 


.6 


37. 


.2 


928 


13 


BX412434 


BX412434 


BX412434 




40 


444 , 


.8 


37 , 


.2 


565 


14 


CD287835 


CD287835 


l_M15,abd 




41 


442 . 


,4 


37. 


. 0 


636 


10 


BE615330 


BE615330 


601280728 


c 


42 


439. 


.6 


36. 


. 7 


722 


13 


BU632686 


BU632686 


UI-H-FEl- 


c 


43 


437 , 


.6 


36, 


.6 


695 


14 


CA425775 


CA425775 


UI-H-FEl- 


c 


44 


436. 


.6 


36. 


. 5 


700 


13 


BU633922 


BU633922 


UI-H-FLl- 


c 


45 


436, 


.4 


36, 


. 5 


1015 


10 


BG683840 


BG683840 


602651694 



ALIGNMENTS 



RESULT 1 
BM562135 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



1072 bp 
NIH MGC 118 Homo 



mRNA linear EST 20-FEB-2002 
sapiens cDNA clone IMAGE : 57454 63 



Craniata ; Vertebrata ; Eutel eostomi ; 
Catarrhini; Hominidae; Homo. 



BM562135 

AGENC0URT_6 5 6 2 0 3 2 
5 * , mRNA sequence . 
BM562135 

BM562135 .1 GI : 18807966 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1072) 
NIH-MGC http://mgc.nci.nih,gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Life Technologies, Inc. 
cDNA Library Preparation; Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consortium/ LLNL at: 

http : //image . llnl .gov 

Plate: LLAM12768 rovt: h column: 

High quality sequence start: 9 

High quality sequence stop: 647. 
Location/Qualif iers 
1. .1072 

/organ ism=" Homo sapiens" 
/mol_type=" mRNA" 
/db__xref = "taxon: 9606" 
/clone=" IMAGE : 5745463 " 



08 



BASE COUNT 
ORIGIN 



/ 1 i s sue_type= " 1 eukocy t e " 

/lab_host="DH10B" 

/ C 1 one_l ib= " N I H_MGC_1 1 8 " 

/note="Vector : pCMV-SP0RT6; Site_l: NotI; Site_2 : EcoRV 
(destroyed) ; RNA source leukocytes from anonymous pool of 
non-activated adult donors. Library is oligo-dT primed 
and directionally cloned (EcoRV site is destroyed upon 
cloning). Average insert size 1.7 kb, insert size range 
1.2-3-3 kb. Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
(Invitrogen) . Research Genetics tracking code 027, Note: 
this is a NIH_MGC Library." 
217 a 356 c 313 g 186 t 



Query Match 60.9%; 
Best Local Similarity 83.6%; 
Matches 885; Conservative 



Score 72 9; DB 12; Length 1072; 
Pred. No. 1.7e-155; 
0; Mismatches 145; Indels 29; 



Gaps 



4; 



Qy 


117 


Db 


20 


Qy 


177 


Db 


80 


Qy 


237 


Db 


140 


Qy 


297 


Db 


200 


Qy 


357 


Db 


260 


Qy 


417 


Db 


320 


Qy 


477 


Db 


380 


Qy 


537 


Db 


440 


Qy 


597 


Db 


500 


Qy 


657 



GCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCT 176 

llllllllllllllllllllll lllllll IMIIIMIII lllllllllll Mill 

GCTGGTGAAGCGGAAGCGCATCTAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCT 7 9 
TGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGC 236 

IIIIIIIIIIIIIIIIMIIIII llllllllllllllllllll lllll II M II 

CGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGC 139 



TCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCC 

II IMIIIM IIIIMIIIIIMI llllllll MM IIIIIMIIIMIIIII 

CCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCC 



296 



199 



356 



AGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCA 

lllll IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM I MM I 

TGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGA 259 

AATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGA 416 

IMIIIIII II llllll I I Mini IMIIIM I llllllll II II 

AATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGA 319 

GCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAG 476 

MMM IMIIMI M Mill M IMIIIM MMMMIIMM IMIIIM 

GCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAG 379 

GCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTG 536 

IIIIIIIIIIIIIMMIIIIIIIIIIIIIIII IIIMIIIIMIIIIII lllllll 

GCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTG 43 9 

GCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGA 596 

III lllllllllllllllllllllll lllll lllll II MMM I II Mill 
GCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGA 4 99 



M 



II I 



II lllllll MM llllllll lllllllllll II lllll II llllllll 



Db 



560 TAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT^ 619 



Qy 717 CAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCT 776 

II I I llllllll llllllllllllllllllll IIIIIIIIIIIIM llllllll 
Db 62 0 CACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCT 679 



Qy 777 CCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGA 836 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIII nil! IIIIIIIIMIIIIIII 

Db 680 TCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCG--- 736 



Qy 837 CTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAA 8 96 

II lllllll IMMIII llllllllllll Mlllll 

Db 737 AGCCCCTGGACACCAACTATTGCTTCAGCTCCCCGGAGAA 776 



Qy 897 GAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGAT 956 

IIMIIIIIIIIIIIIIIIII llllllllllllll I llllll llllllllllllll 
Db 777 GAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAGGGACCTCGGCTGGAAGTGGAT 83 6 

Qy 957 TCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAG 1016 

II II III I MIIIIMIIII I IMMIII llllllll llllllll Mill 

Db 837 CCACGAGCCCCAAGGCTACCATGCCCACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAG 896 



Qy 1017 CCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTC 1076 

III Mill I lllllll llllll lllllll II II MM II I II 

Db 8 97 CCTGGACACGCGTACCAGCAAGTCCCTGGCCCTGTACACCCGCCATAACCGGGCGCCCTC 956 

Qy 1077 GGCGGCGCCGTGCTGC--GTGCCGCAGGCGCTGGAGCCACTGCCCAT--CGTGTACTACG 1132 

IMIIIII I Ml I II II Mill II llllll I I lllllll 
Db 957 GGCGGCGCGGGTTTGCTGGGCCCCAGGCCCCTGGACCCCCTGCCCCTTCGGGGTACTACC 1016 

Qy 1133 TGGGCCGC--AAGCCCAAGGTGGAGCAGCTGTCCAAC::AT 1169 

MM III I llllll Ml II llllll MM 

Db 1017 TGGGGCGCCAAGCCCCAAGTGGGAACACCTGTCCCACAT 1055 



RESULT 2 

BX355682/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BX355682 1201 bp mRNA linear EST 05-MAY-2003 

BX355682 Homo sapiens PLACENTA COT 25-NORMALIZED Homo sapiens cDNA 
clone CSODI002YJ05 3 -PRIME, mRNA sequence. 
BX355682 

BX355682.1 GI :30371987 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1201) 

Li.W.B., Gruber,C., Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref ©genoscope . ens . f r , Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 



more information about this cluster, see 
http: //www.genoscope. ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0DI002CE03NPl&cluster=9160 . r . Contact : 

Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID ; CS0DI0O2CE03NP1 . 
FEATURES Location/Qualifiers 
source 1. .1201 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DI002yj05" 

/tissue__type=" PLACENTA COT 2 5 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 25 -NORMALIZED" 
/note="lst strand cDNA was primed with a Not I -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 198 a 326 c 386 g 253 t 38 others 

ORIGIN 

Query Match 57.8%; Score 692.2; DB 13; Length 1201; 

Best Local Similarity 86.7%; Pred. No. 4.2e-147; 

Matches 824; Conservative 4; Mismatches 93; Indels 29; Gaps 5; 

Qy 250 ACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG-GCGGACTA 308 

I llllllllllll mill I IIIIIIIIIIIIIMM III Ihllll 

Db 104 6 AACCGCGACCGGGTGGCCGGG--AGAKKCAGAACCGGAGCCCGAGCCTGAGCCCGRACTA 989 

Qy 309 CTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAA 368 

llllllllllllllhllllllllllMIIIIIIIMI I MM llllllllll II 

Db 988 CTACGCCAAGGAGGTMACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGACAA 92 9 

Qy 369 ATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGC 428 

llllll I I llllll IMIIIII I IIIIIIM II llllllll Mill 

Db 928 GTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGC 869 

Qy 42 9 GGTGCCGGAACCTGTATTGCTCT-CTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAA 487 

III M Mill II IMIIII I IIIMIIIIIIMI IIMMIIIIIMMIMI 

Db 868 GGTACCTGAACCCGTGTTGCTCTCCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAA 8 09 

Qy 488 AAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCA 547 

MIIIIIIIMIIIIIIIIIM IIIMIIIIIIMMII IIIMIIIII lllllll 

Db 8 08 AAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCA 74 9 

Qy 548 GCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAG 607 

MMMMMMMM Mill IIIM II llllll I II Mllllllllllllll 

Db 74 8 GCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAG 68 9 

Qy 608 TTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACT 667 

MMMIMMIM Ml III III M M II II llllllll II lllllll 

Db 688 TTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGA-GGCTTTCGCCTTAGCGCCCACT 630 

Qy 668 CTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCC 727 

MM MIIMII IIMIIIIIII II Mill II IMMIIIM I I MM 

Db 62 9 GCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCC 570 



Qy 


728 


GCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCA 

Mil IIIIIIIIMIIIIIIIIII llllllllllllll llllllll llllllllll 
GCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCA 


787 


Db 


569 


510 


Qy 

Db 


788 
509 


CCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTACAAGGATG 

IIIIIIIIIIIIMIMIIIII Mill IMIIIIIIIIIIIIIII 

CCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 


847 
463 


Qy 


848 


ACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCG 

IIIIMM IMIIIII MMIIIIIIMIIIIMIII Mlllllll 

GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGNACTGCTGCG 


907 


Db 


462 


414 


Qy 


908 


TGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCA 

IIIIMIMI IIMMMIIIIM IMMMI IIIIIIIIMIIII M II MM 

TGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCA 


967 


Db 


413 


354 


Qy 


968 


AGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTC 


1027 


Db 


353 


MIIIIIMIIIIIII MMIMI llllllll IIIIMM llllllll Mill 1 

AGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGC 


294 


Qy 


1028 


AGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGT 

IIIIIIIIIIMIIIIIM llllllllllllll lllllllllll IIIIIIMIIIM 


1087 






OTA 


Qy 


1088 


GCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCA 

IIIIIIIIIIIIIIIMIMIIIM IIIIIIIIIIIIIIIIIIIIIIIIIMMIIIII 

GCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCA 


1147 


Db 


233 


174 


Qy 


1148 


AGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIII llllllllllllllllll 
AGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 124 




Db 


173 





RESULT 3 

BX335351/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BX335351 983 bp mRNA linear EST Ol-MAY-2003 

BX335351 Homo sapiens PLACENTA COT 25-NORMALIZED Homo sapiens cDNA 
Clone CS0DI013YH16 3 -PRIME, mRNA sequence. 

BX335351 

BX335351. 1 GI :30308367 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 983) 

Li.W.B., Gruber,C., Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : wvw.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 

http: //www. genoscope. ens . fr/cgi -bin/cluster . cgi?seq=CS0DI013DD08NPl 



FEATURES 

source 



BASE COUNT 
ORIGIN 



&cluster=9160 . r . Contact : Feng Liang Email : fliang@lifetech.com 
URL : http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSQDI 013DD08NP1 . 

Location/Qualifiers 

1. .983 

/organisms "Homo sapiens" 
/mol_type="mRNA" 
/db__xref="taxon: 9606" 
/clone="CS0DI013YH16" 

/tissue_type=" PLACENTA COT 25 -NORMALIZED" 
/clone_llb="Homo sapiens PLACENTA COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not 1 and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
170 a 280 c 323 g 203 t 7 others 



Query Match 57.3%; Score 68 5.8; DB 13; Length 983, 

Best Local Similarity 86.6%; Pred. No. l.le-145; 

Matches 805; Conservative 5; Mismatches 93; Indels 27; 



Gaps 



4; 



Qy 



Db 



268 GGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACC 327 

Mill INI IIIIIIIIIMIIIIII Mill IIIIIMIIIIIIIIMIIIIIII 

983 GGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGACTACTACGCCAAGGAGGTCACC 924 



Qy 
Db 

Qy 
Db 



328 CGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAATTCAAGGGCACCCCCCAC 387 

IMIIMIIIMMMhl I MM MMMMM M MUM I I III 

923 CGCGTGCTAATGGTGGARACCCACAACGAAATCTATGACAAGTTCAAGCAGAGTACACAC 864 

388 AGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTG 447 

III IIIIIIM I IIIIIIM II llllllll IIIIIMI II Mill II III 

863 AGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGTTG 804 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

ub 

Qy 

Db 

Qy 
Db 



44 8 CTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 5 07 

Mill llllllllllllll IIIMIMIIIIIMMIIIIIIIIIIIIIIIIIIIIII 

803 CTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 744 
508 CTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCC 567 

II IMMIMMIIIIIII IhlllllM IMIIIIIIIIMIIIIIIIIII III 

743 CTGTACCAGAAATACAGCAACAATYCCTGGCGATACCTCAGCAACCGGCTGCTGGCACCC 684 

568 AGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACC 627 

II Mill M llllll I II IIIIIIIIIIIIIIMMIIIIIIIIIIII III I 
683 AGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGC 624 

628 CGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGAT 687 

II III M M II II llllllll II lllllll MM llllllll III 

623 CGTGGAGGGGAAATTGA-GGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGAT 565 

688 AACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACC 747 

llllllll M Mill M MMMMM I I IIIIIIM IIMIIIIIIMMI 

564 AACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCACC 505 
748 ATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAG 807 

Mill IIIMMMIIIII Mllllll IIMIMMIIIIIIIMIIIIIIIMIII 

504 ATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAG 445 



Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 



8 08 CACCTGCACAGCTCCCGGCACCGCCGAGACTACAAGGATGACGACGACAAGGCCCTGGAT 867 

II lllll IIIIIIIIIIIIMIIM llllllll 
444 CATCTGCAAAGCTCCCGGCACCGCCGA GCCCTGGAC 409 

868 ACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGAC 927 

IIIIMM IIIIIIIIMIMMIIMMMINMIIIIIMIIIIM MIIIMM 

4 08 ACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGAC 34 9 
928 TTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTC 987 

lllll llllllll llllllllllllll II II llllllllllllllllllll III 

34 8 TTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTC 289 
988 TGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCT 1047 

IIIM llllllll llllllll MMIMI lllll llllh: llllllllllll 

288 TGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTAMMMCAAGGTCCTGGCC 22 9 
1048 CTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTG 1107 

OIMIIMIIMI lllll Mill IIIIMIIMIIl I IIIIIMIIMIIIM 

228 MTGTACAACCAGG^TAACCC-GGCGCCTCGGCGGCGCCGTNGT-CGTGCCGCAGGCGCTG 171 
1108 GAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAAC 1167 

lllll IIIIMIIIMIIMIIMIIIII IIIIIIIIIMIIIIIIIIIIIIIIIIII 

170 GAGCCGCTGCCCATCGTGTACTACGTGGGCTGCAAGCCCAAGGTGGAGCAGCTGTCCAAC 111 
1168 ATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIMIIIIII MIIIIIIIIIMIIMI 

110 ATGATCGTGCGCTCCTGCAAGTGCAGCTGA 81 



RESULT 4 
BX349319 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BX349319 900 bp mRNA linear EST 05-MAY-2003 

BX349319 Homo sapiens B CELLS (RAMOS CELL LINE) COT 25 -NORMALIZED 
Homo sapiens cDNA clone CS0DL010YL07 5-PRIME, mRNA sequence. 
BX349319 

BX349319. 1 GI :30379410 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 900) 

Li.w.B., Gruber,C., Jessee,J. and Polayes.D. 
Full-length cDNA libraries and normalization 
Unpublished 

Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
ht tp : // www . genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0BAG05 9ZD04_CS05596_l&cluster=9160 .r . 
Contact : Feng Liang Email : f liang®lif etech. com URL 
http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 



Faraday Avenue Genoscope sequence ID : CSOBAG059ZD04_CS05596_1 . 
FEATURES Location/Qualifiers 
source 1. .90 0 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db__xref ="taxon: 9606" 
/clone="CS0DL010YL07" 

/cell_type="B CELLS (RAMOS CELL LINE) COT 2 5 -NORMALIZED" 
/eel l_line=" RAMOS CELL LINE" 

/clone_lib="Homo sapiens B CELLS (RAMOS CELL LINE) COT 
2 5 -NORMALIZED" 

/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 179 a 292 c 270 g 155 t 4 others 

ORIGIN 

Query Match 57.0%; Score 682.4; DB 13; Length 900; 

Best Local Similarity 88.4%; Pred. No. 6.4e-145; 

Matches 74 0; Conservative 0; Mismatches 97; Indels 0; Gaps 0; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM M MMIIIIIM IMIIIIIIIIII IIIMIMIIIIII II III 

Db 35 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 94 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

MIMMMMMMMIMMI Mill IMMIIIIIIIII MMIIIIIIMIM 

Db 95 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 154 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

IIIIIIIIIIIIMIIIIIIIIMII lllllllllll lllllllllll Mill III 

Db 155 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 214 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IMMIIIIIIIIIIMIII MIIIIMIIIMMIIIM IIIM II II II M 

Db 215 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 274 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

llllllll IMIIIMMMII llllllll MM IIIIIIIIMIIIIIM III 

Db 275 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 334 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II IMMMMMMMMMMIMIMMMMMIMIMM I MM Mill 

Db 335 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 3 94 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

Mill II llllll I I MUM IIIIMII I IIIIIIM II llllll 

Db 395 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 4 54 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

M llllllll II Mill II llllllll llllllllllllll llllllllllll 
Db 455 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 514 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIIIMIIIIIMIIIMMIIMIMIMMMMMIIIMIM MIIIMIII 

Db 515 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 574 



Qy 


541 


Db 


575 


Qy 


601 


Db 


635 


Qy 


661 


Db 


695 


Qy 


721 


Db 


755 


Qy 


781 


Db 


815 



lllllllllllllllllllll HIM Mill II IMMI I II Mlllllll 



I Mill MMMM 



IMMI MM MMMM MIIMMIII II HIM M IIIMIIIII I 



III 



Mllllllll 



Ml 



III 



MMMMMII II HIM Ml 



MM 



II 



II 



II 



I I 



RESULT 5 

BX324511/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BX324511 1041 bp tnRNA linear EST 02-MAY-2003 

BX324511 Homo sapiens NEUROBLASTOMA COT 25-NORMALIZED Homo sapiens 
cDNA clone CS0DC024YD2 0 3 -PRIME, mRNA sequence. 
BX324511 

BX324511, 1 GI : 3 0332381 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1041) 

Li^W.B., Gruber,C., Jessee,J. and Polayes.D. 
Full-length cDNA libraries and normalization 
Unpublished 

Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope-cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : //www . genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0AC024DB10NP2&cluster=9160 ,r , Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0AC024DB10NP2 . 

Location/Qualifiers 

1. .1041 

/ organ ism=" Homo sapiens" 
/mol__type=" mRNA " 
/db_xref="taxon: 9606" 
/ c 1 one= " CS ODCO 2 4 YD2 0 " 

/tissue_type= "NEUROBLASTOMA COT 25-NORMALIZED" 



/clone_lib="Homo sapiens NEUROBLASTOMA COT 25-NORMALIZED" 

/note="lSt strand cDNA was primed with a NotI -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 176 a 283 c 346 g 219 t 17 others 

ORIGIN 

Query Match 52.6%; Score 62 9.6/ DB 13; Length 1041; 

Best Local Similarity 84.7%; Pred, No. 7.1e-133; 

Matches 788; Conservative 8; Mismatches 102; Indels 32; Gaps 7; 

Qy 269 GGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCC 328 

:|! 1 ^1 lilllllllMlll II :h h II I h lllllll 

Db 1022 SGGGAGAKGSAGAACCGGAGCCCGACCCTRASCCR---ACTAyACCCCAAGRAGTCACCC 966 

Qy 32 9 GCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAATTCAAGGGCACCCCCCACA 388 

MMhIlllhl Mil I III! IIIIIIIIM II mill I I INI 

Db 965 GCGTGYTAATGKT-GAAACCCACAACGAAATCTATGACAAGTTCAAGCAGAGTACACACA 9 07 

Qy 389 GCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGC 448 

II Illlllll I Illlllll II Illlllll IIMIMI II lllll II MM 

Db 906 GCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGTTGC 847 

Qy 44 9 TCTCTC-GGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 507 

Mil I lllllllllllll IIIIIIIIMMMIIIIIIMMMIMMIIIIIIII 

Db 846 TCTCCCSGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 787 

Qy 508 CTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCC 567 

II IIIIIIIIIIIIIMII Mllllllll lllllllllllllllllllllll III 

Db 786 CTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGCACCC 727 

Qy 568 AGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACC 627 

II lllll II mill I M iiiiiiiimiiiiiiiiimmiiii m i 

Db 72 6 AGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGC 667 

Qy 62 8 CGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGAT 687 

II III II II II II imilll II lllllll MM Illlllll III 
Db 666 CGTGGAGGGGAAATTGA-GGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGAT 608 

Qy 688 AACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACC 747 

Illlllll II lllll II iiiiiiiiii I I Illlllll iiiiiiMiiiim 

Db 607 AACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCACC 54 8 

Qy 748 ATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAG 807 

lllll IIIIIIIIIIIMI Illlllll IIIIIIIIIIIIIIIIIIIIIIIMMIII 
Db 547 ATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAG 488 

Qy 8 08 CACCTGCACAGCTCCCGGCACCGCCGAGACTACAAGGATGACGACGACAAGGCCCTGGAT 8 67 

M lllll llllllllllllllllll Mllllll 
Db 4 87 CATCTGCAAAGCTCCCGGCACCGCCGA GCCCTGGAC 4 52 

Qy 868 ACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGAC 927 

Illlllll IIIIIIIIIIIIIMIIIIIIIIIIMIMIIIIIIIIMI IIIIMMI 

Db 451 ACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGAC 3 92 



Qy 



928 TTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTC 987 



Mill llllllll llilllllllllll II II llllilllllllllllllll III 

rTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTC 332 



Db 


391 


TTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTC 


332 


Qy 


988 


TGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCT 

Mill llllllll llllllll llllllll Mill IIIIIIIIIIMIIllllll 

TGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCC 


1047 


Db 


331 


272 


Qy 
Db 


1048 
271 


CTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTG 
llilllllllllll Mill 1 1 1 1 1 llilllllllllll 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

1 M M 1 M 1 M M 1 1 1 1 1 1 Mill llilllllllllll 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 
CTGTACAACCAGCATAACCC-GGCGCCTCGGCGGCGCCGTG-TGCGTGCCGCAGGCGCTG 


1107 
214 


Qy 


1108 


GAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAAC 


1167 


Db 


213 


Mill MIMMMMIIIMMIIIIMMIIIIMIMIIIIMIIMMIIMMI 

GAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAAC 


154 


Qy 


1168 


ATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIIM llllllllllllllllll 
ATGATCGTGCGCTCCTGCAAGTGCAGCTGA 124 




Db 


153 





RESULT 6 
BM555996 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BM555996 1043 bp mRNA linear EST 20-FEB-2002 

AGENCOURT_6544437 NIH_MGC_88 Homo sapiens CDNA clone IMAGE : 555003 9 
5', mRNA sequence. 
BM555996 

BM555996. 1 GI : 18796907 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1043) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpubl ished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r®mail . nih . gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consortium/ LLNL at: 

http : // image. llnl .gov 

Plate: LLAM12261 row: i column: 16 

High quality sequence stop: 702. 
Loca t lon/Qua 1 i f i ers 
1. .1043 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db__xref="taxon: 9606" 
/clone=" IMAGE: 5550039" 

/tissue_type= "duodenal adenocarcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib="NIH__MGC_88" 

/note="Organ: small intestine; Vector: pCMV-SP0RT6; 



BASE COUNT 
ORIGIN 



Site_l: NotI; Site_2: Sail; Cloned unidirectionally ; 
oligo-dT primed. Average insert size 1.767 kb. Library 
enriched for full-length clones and constructed by Life 
Technologies. Note: this is a NIH_MGC Library." 
202 a 329 c 345 g 154 t 13 others 



Query Match 52 . 6%; 

Best Local Similarity 88.1%; 
Matches 706 ; Conservative 



Score 629.4; DB 12; 
Pred. No. 7.9e-133; 
0; Mismatches 71; 



Length 1043; 
Indels 24; Gaps 



1; 



Qy 
Db 



386 ACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTAT 44 5 

iiiM iiiMiii I IIIIIIII II IIIIIIII IIIIIIII II mil II I 

1 ACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGT 60 



Qy 

Db 

Qy 
Db 



44 6 TGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 505 

IIMIII llllllllllllll IIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIII 

61 TGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 12 0 

506 AGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCC 565 

III! Illllllllllllllll IIIIIIIIII lllllllllllllllllllllll I 

121 AGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGCAC 18 0 



Qy 

Db 


566 
181 


CCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGA 

INI Mill II llllll 1 II IIIIMIIIIIIIIIIIIIIIIIIIIIMI III 

CCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGA 


625 
240 


Qy 


626 


CCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAG 

III III II M Mill IIIIIIII II IIIMII MM IIIIIIII 1 

GCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGG 


685 


Db 


241 


300 


Qy 
Db 


686 
301 


ATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCA 

IIIIIIIIII II Mill II IIIIIIIIII 1 1 IMIIIII IIIIIIIIIIIII 

ATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCA 


745 
360 


Qy 
Db 


746 
361 


CCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCC 

IIIMII IIIIIIIIMIIII Mllllll IIIIIIIIIMIIIIIIIIIIIIIIMI 

CCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCC 


805 
420 


Qy 


806 


AGCACCTGCACAGCTCCCGGCACCGCCGAGACTACAAGGATGACGACGACAAGGCCCTGG 


865 


Db 


421 


MM IIIM llllllllllllllllll IIIMII 

AGCATCTGCAAAGCTCCCGGCACCGCCGA GCCCTGG 


456 


Qy 


866 


ATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTG 

1 IIIIIIII IIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIM Illllll 

ACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTG 


925 


Db 


457 


516 


Qy 
Db 


926 
517 


ACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATT 

Illllll IIIIIIM llllllllllllll II II llllllllllllllllllll 1 
ACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACT 


985 
576 


Qy 


986 


TCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGG 


1045 


Db 


577 


MMIII IIIMIII IIMMM IIIIIIII Mill IIIIMIIIIIIMIIIM 

TCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGG 


636 


Qy 


1046 


CTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGC 

1 IMIIIIIIIIIII lllllllllll IIIIIIIIIIIIIIIIIIIIIIIMIIIIM 


1105 



Db 637 CCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGC 696 

Qy 1106 TGGAGCCT^CTGCCCATCGTGTACTACGTGGGCCGOU^GCCCAAGGTGGAGCAGCTGTCC^ 1165 

IMIIII IIIMIIIIMIIIIIMIIIIIIIIMIIIIIIMIIIIIMIIIIIIMI 

Db 697 TGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCA 756 

Qy 1166 ACATGATCGTGCGTTCCTGCA 118 6 

lllllllllllll Mlllll 
Db 757 ACATGATCGTGCGCTCCTGCA 777 



RESULT 7 

BX383773/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BX383773 950 bp mRNA linear EST 08-MAY-2003 

BX383773 Homo sapiens HELA CELLS COT 25 -NORMALIZED Homo sapiens 
cDNA Clone CS0DK001YA15 3-PRIME, mRNA sequence. 
BX383773 

BX383773.1 GI:30457168 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 950) 

Li,W.B., Gruber,C., Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpubl ished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr; VJeb : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0DK001AA08NPl&cluster=9160 .r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DK001AA08NP1 . 

Location/Qual if iers 

1. .950 

/organism="Homo sapiens" 
/mo 1 _t ype = " mRNA " 
/db_xref ="taxon: 9 606" 
/clone="CS0DK001YA15" 

/eel l_type= "HELA CELLS COT 25 -NORMALIZED" 
/cell_line="HELA" 

/clone_lib="Homo sapiens HELA CELLS COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a Notl-oligo (dT) 

primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
171 a 260 c 312 g 195 t 12 others 



Query Match 50.4%; Score 603.6; DB 13; Length 950; 

Best Local Similarity 83.9%; Pred. No. 5.7e-127; 



Matches 717; Conservative 8; Mismatches 104; Indels 26; Gaps 



3; 



Qy 


343 


GAAAGCGGCAACaWlTCTATGATAAATTCAAGGGC7i.CCCCCCACAGCTTATAT^ 


402 


Db 


943 


hill INI III M Ml! Ml llllll 1 1 llllll llllll M 1 

GRAACCCACAACGAAATCTATGACAAGTTCAAGCAGAGMACACACAGCATATATATGTTC 


884 


Qy 


403 


TTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAG 

II II hi 1 II llllllll llllllll II |:|ll II llllllll Mil II III 
TTCAAMACATCAGAGCTCCGAGAAGCGGTACCTGVACCCGTGTTGCTCTCCCGGGCAGAG 


462 


Db 


883 


824 


Qy 


463 


CTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATAC 


522 


Db 


823 


Mill IIIIIIIMMIIIIIIIIIIIIIMIMIIIIIIIIIIII IIMIIIIIIII 

CTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATAC 


764 


Qy 


523 


AGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAG 

IIIII II :|||IM IIMIIIIMIIIIMIIIIMI IIIII IIIII M III 

AGCAACAATAMCTGGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAG 


582 


Db 


763 


704 


Qy 


583 


TGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATA 


642 


Db 


703 


Ml 1 M llllllllllllllllllllllllllllll III III III II II 

TGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATT 


644 


Qy 


643 


GAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTG 

II II Mllllll II lllllll MM 1 llllll III Mlllll II III 

GA - GGCTTTCGCCTTAGCGCCCACTGCTCCTGTTACAGCAGGGATTACACACTGCAAGTG 


702 


Db 


643 


585 


Qy 


703 


GAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAAC 

II II Mllllll 1 1 1 Mllllll IIIIIIIIIIMIIIIMII lllllllll 

GACATCAACGGGTTTACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAAC 


762 


Db 


584 


525 


Qy 


763 


CGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCC 

Mill llllllll llllllllllllllllllllllllllllllll Mill llllll 

CGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCC 


822 


Db 


524 


465 


Qy 


823 


CGGCACCGCCGAGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTC 

IIIIIIIIIMI Ihlllll llllllll llllll 

CGGCACCGCCGA GCMCTGGACACCAACTATTGCTTC 


882 


Db 


464 


429 


Qy 


883 


AGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTG 

MMMMMMMMIMMMMMMMMM lllilhlllllll llllllll 

AGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTKACTTCCGCAAGGACCTC 


942 


Db 


428 


369 


Qy 


943 


GGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGT 

MIMIMIMIM II II IIIIMIIIIIIIIIIII 1 llllllll llllllll 

GGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCGACTTCTGCCTCGGGCCCTGC 


1002 


Db 


368 


309 




1 0 n 

± U U J 


iAtwAi L. iULrALik-L i/\uAL-AL 1 CJAtj 1 ALAULAACjCj i LL luOL 1 L luj. ACAACCAGCAC 

llllllll llllllll IIIII llllllllllllllllllll 1 1 1 

CCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCX3CCTGTACAACCAGCA 


1062 


Db 


308 


249 


Qy 


1063 


AACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATC 

1 II Mill llllllllllllll lllllllllllllllllllllll lllllllll 
TAACCCGGCGCCTCGGCGGCGCCGTG-TGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATC 


1122 


Db 


248 


190 


Qy 


1123 


GTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCC 


1182 


Db 


189 


IIIIIMIMIMII MIIIIMIIIIIIIIIIIIIMIII IMIIIIMIIII III 

GTGTACTACGTGGGCKKCAAGCCCAAGGTGGAGCAGCTGTCCNACATGATCGTGCGCTCC 


130 



Qy 1183 TGCAAGTGCAGCTGA 1197 

IIIIIMIMIIIM 
Db 129 TGCAAGTGCAGCTGA 115 



RESULT 8 

BX434425/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BX434425 888 bp mRNA linear EST 15-MAY-2003 

BX434425 Homo sapiens PLACENTA Homo sapiens cDNA clone CS0DE014YE16 
3 -PRIME, mRNA sequence. 
BX434425 

BX434425 . 1 GI : 3 07792 91 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 888) 
Li^W.B. , Gruber, C 



FEATURES 

source 



Crania ta ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 

and Polayes,D. 



BASE COUNT 
ORIGIN 



Jessee, J. 

Full-length cDNA libraries and normalization 
Unpubl ished 

Contact: Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : wwvj.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : / / www . genoscope . ens . f r/ 

cgi -bin/cluster . cgi?seq=CS0BAK028AB08NMl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0BAK028AB08NM1 . 

Loca t ion/ Qua 1 i f i er s 

1. .888 

/organism="Homo sapiens" 

/mo 1 _t ype = " mRNA " 

/db__xref ="taxon: 9606" 

/clone= " CS0DE014 YE16 " 

/ 1 i s sue_t ype = " PLACENTA " 

/clone_lib="Homo sapiens PLACENTA" 

/note=" Vector : pCMVSP0RT__6 ; 1st strand cDNA was primed 
with a NotI -oligo (dT) primer. Five prime end enriched, 
double-strand cDNA was digested with Not I and cloned into 
the Not I and EcoRV sites of the pCMVSPORT 6 vector. 
Library was not normalized." 
156 a 253 c 302 g 176 t 1 others 



Query Match 49.2%; Score 589; DB 13; Length 888; 

Best Local Similarity 87.1%; Pred. No. l,2e-123; 

Matches 679; Conservative 0; Mismatches 76; Indels 25; Gaps 2; 
Qy 418 CTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGG 477 

MM IIMIII M Mill I III I IIMIIIIIIIIII lllllll I 

Db 888 CTCCAAAAAGCGGTACCTGAACCCTTGTTGTCTNCCCGGGCAGAGCTGCGTCTGCTGA-G 830 



Qy 478 CTCAAGTTAAAAGTGGAGCAGCTiLCGTGGAGCTATACCAGAAATACAGCAATGATTCCTG^ 537 

llllllll III lllllllllllllllllll lllllllllllllllll llllllll 
Db 829 CTCAAGTTTAAAATGGAGCAGCACGTGGAGCTGTACCAGAAATACAGC^ 770 

Qy 538 CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 597 

II MIIIIMMIIIIIIMIMII I Ml Mill M IMMI I II MUM 

Db 769 CGATACCTCAGCAACCGGCTGCTGGCACACAGCGACTCGCCAGAGTGGTTATCTTTTGAT 710 

Qy 598 GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 657 

IIIIIIMIIIMMMMMMI Ml Ml III II M Mill IIIMMl 

Db 709 GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 650 

Qy 658 AGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 717 

II IMMI! MM IIIIIIM IIIIIIIIMI II Mill II MIIIIIM 

Db 64 9 AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC 590 

Qy 718 AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 777 

I I I llllllll IIIIIIIIIIIIIIMIIII MIIIIMIIIIII llllllll 

Db 58 9 ACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTT 530 

Qy 778 CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGAC 837 

IIIMMMIMIIIIIIIMIIIMIIIMI MMI IIIIIIIMIIIIIIIII 

Db 52 9 CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- - - 4 73 

Qy 838 TACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAG 897 

llllllll IIIMMl MIIIIMIIIIIIIIIIIII 

Db 472 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAG 434 

Qy 898 AACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATT 957 

IIMIIMIIIIIMIIIII IMIIIIIMIIII IIIMMl IIMIIIIMMII 

Db 433 AACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATC 374 

Qy 958 CATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGC 1017 

II M IIMMMMIIMMMII IIIMMl IMIMM MIIIIM MMM 

Db 373 CACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGC 314 

Qy 1018 CTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGG 1077 

II Mill IIMIIIIIIIIIIIMIII IIIIIIIIIIMII MIMMIMI III 

Db 313 CTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCG 254 

Qy 1078 GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGC 1137 

lllllllllllllllllllllllllllllllllll IIIIIIIMIIMIIIIIIIIIII 
Db 253 GCX3GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGC 194 

Qy 1138 CGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGC^^^ 1197 

IIIIIIIIMIMMIIIMIMMIMMMMIMMM MIMMIIMIMIIM 

Db 193 CGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 134 



RESULT 9 
AL530081 

LOCUS AL530081 859 bp mRNA linear EST 23-MAY-2003 

DEFINITION AL530081 Homo sapiens NEUROBLASTOiVIA COT 50 -NORMALIZED Homo sapiens 

cDNA clone CS0DD009YM06 5-PRIME, mRNA sequence. 

ACCESSION AL53 0081 

VERSION AL530081.2 GI:31067916 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 859) 

Li,W.B., Gruber,C., Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 13, 2001 this sequence version replaced gi: 12793574. 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 

was normalized. Library was constructed by Life Technologies, a 

division of Invitrogen. This sequence belongs to sequence cluster 

9160. r For more information about this cluster, see 

http : //www. genoscope . ens . f r/ 

cgi-bm/cluster . cgi?seq=CS0DD009BG03QPl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DD0O9BG03QP1 . 

Locat ion/Qual i f iers 

1. .859 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 96 06" 
/clone=" CS0DD009YM06" 

/tissue_type="NEUROBLASTOMA COT 50 -NORMALIZED" 
/clone_lib="Hon\o sapiens NEUROBLASTOMA COT 50 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
179 a 272 c 257 g 148 t 3 Others 



Query Match 48.2%; Score 577.2; DB 9 

Best Local Similarity 87.3%; Pred. No. 5.6e-121 
Matches 641; Conservative 2; Mismatches 90 



Length 859; 

Indels 1; Gaps 1, 



Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III MM II IIIIMIIIII III MIMIIII IIIIIIMIIIIM II III 

126 ATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 185 
61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

MIMIIIIIMMIIIIIIIM MM! IMIMMMMII IMIIIMMMMI 

186 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 245 
121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

MIMIMMIIIMMMMMMI MMIIMMI IMMIMIII Mill III 

24 6 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 305 
181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIIIIIIIIIIIIMIIII MIIMIIIIIMIIIMII Mill M II II M 

306 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 365 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IIMMM IMMIIIIIMII IIMIIII Mil 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M III 

366 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 425 
301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II IIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIII I Mil Mill 

426 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 485 
361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

IIIM M IIIMI I I MIIM MMIIM I IIMMM M IIIMI 

4 86 TATGAOUVGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAAC^ 54 5 

421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

II MMIIM II Mill II MIIIMI MIMMIIIIIII IIIMMIIIM 

546 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 605 
481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIIIIIIMIMMIIMIIIIIIIMII MMIMMIIIIMII MMIIIMI 

6 06 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 665 
541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTG-GCTGTCCTTTGATGT 599 

IIIIIIIIIIIIMIIIIIIIII Mill Mill II Mill I I II MMIIM 

666 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGTGTTATCTTTTGATGT 725 
600 CACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAG 659 

IIIIIIIIMIIIIIIIIIMI Ml III III II II Mill MMIIM II 

726 CACCGGAGTTGTGCGGCAGTKGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAG 785 
660 TGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAA 719 

IMIIII MM MMIIM MIIMMIM M llhl M IMMIIIII 

786 CGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGRACATCAACGGGTTCAC 84 5 
720 TTCTGGCCGCCGGG 733 

I I IIMIIII I 

846 TACCGGCCGCCGAG 859 



RESULT 10 

BE312000 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BE312000 713 bp mRNA linear EST 26-OCT-2000 

601154768F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE:3510592 5', 
mRNA sequence. 
BE312000 

BE312000.1 GI:9130128 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 713) 

NIH-MGC http://mgc-nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 



FEATURES 

source 



BASE COUNT 
ORIGIN 



cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: image.llnl.gov 
Plate: LLCM195 row: h column: 17 
High quality sequence start: 2 
High quality sequence stop: 713. 
Location/Qualifiers 
1- .713 

/organ! sm= "Homo sapiens" 

/mol_type=" mRN A " 

/db_xref ="taxon: 9606" 

/clone=" IMAGE: 3510592" 

/t is sue_type= "neuroblastoma" 

/lab_host="DH10B (phage-resistant ) " 

/ c lone_l ib= " N I H_MGC_1 9 " 

/note=" Organ: brain; Vector: pOTB7; Site^l: Xhol ; Site__2 : 
EcoRI ; cDNA made by oligo-dT priming, Directionally 
cloned into EcoRI/XhoI sites using the following 5' 
adaptor: GGCACGAG(G) . Library constructed by Lmg Hong 
in the laboratory of Gerald M. Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies) . 
Note: this IS a NIH_MGC Library." 
157 a 226 c 211 g 119 t 



Query Match 47.5%; 
Best Local Similarity 87.4%; 
Matches 623; Conservative 



Score 569; DB 10; Length 713; 
Pred. No. 3.9e-119; 
0; Mismatches 90; Indels 0; 



Gaps 



Qy 



Db 



125 AGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCC 184 

IIIMIIIIIMIMIMINI IIIIIIIMM IIIIMIMM IMM IIIIMI 

1 AGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCC 60 



Qy 

Db 



185 CCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACA 244 

llllllllllllllll IMIMIIMIIIMMIII mil II II II II INI 

6 1 CCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACA 120 



Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 



24 5 ACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGG 304 

III! Illlllllllllll llllllll III! Illllllllllllllll IIMI I 

121 ACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCG 180 

3 05 ACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATG 364 

IIMIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII lllllllllllll 
181 ACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATG 24 0 

365 ATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGG 424 

i M ill ill I I MINI IIMIIII IIIIIIMIIIIIIIIIIII 

241 ACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAG 300 

425 AAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGT 484 

lllllll II Mill M llllllll Illlllllllllll IIIIIMIIIIIIIII 
301 AAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGT 360 

485 TAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACC 544 
lllllllllllllllllllllllll Illllllllllllllll Illlllllllllll 



Db 



3 61 TAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACC 420 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



545 TCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCG 604 

IIIIIMIIIIIIIIIIM mil Mill II Mill! I II IIIIIIMIIIII 

421 TCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCG 48 0 

605 GAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCC 664 

Illlllllllllillll III III III II II Mill lllllill II MM 
481 GAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCC 540 

665 ACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTG 724 

III MM llllllll lllllllllll II Mill II Mllllllll I I I 

541 ACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCG 600 
725 GCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGG 784 

MIIMI IIMMIMMIIIIMIM IMIMMMIIM IMMIM IMMII 

601 GCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGG 660 

78 5 CCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGAC 837 

IIMIIIIIIIIIIIIIIIIIIIII Mill lllllll I I MM I 
661 CCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCCGGCACCGCGAGCC 713 



RESULT 11 
BE260971 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BE260971 717 bp mRNA linear EST 26-OCT-2000 

601153715F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE : 3509931 5', 
mRNA sequence. 
BE260971 

BE260971.1 GI:9132709 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hommidae; Homo. 

1 (bases 1 to 717) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: image.llnl.gov 
Plate: LLCM193 row: m column: 04 
High quality sequence stop: 713. 

Loca t i on / Qua 1 i f i er s 

1. .717 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/ db_xr e f = " t axon : 9 6 0 6 " 

/clones "IMAGE: 350 9 931" 

/ tissue_type="neuroblastoma " 

/lab_host="DH10B (phage-resistant ) " 



BASE COUNT 
ORIGIN 



/clone_lib="NIH_MGC_19" 

/note="Organ: brain; Vector: pOTB7; Site__l: Xhol; Site_2: 
EcoRI ; cDNA made by oligo-dT priming. Direct ionally 
cloned into EcoRI/XhoI sites using the following 5' 
adaptor: GGCACGAG(G) . Library constructed by Ling Hong 
in the laboratory of Gerald M. Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies) . 
Note: this is a NIH^MGC Library." 
158 a 226 c 213 g 120 t 



Query Match 46.9%; Score 561.2; DB 10; Length 717; 

Best Local Similarity 88.1%; Pred. No. 2.3e-117; 

Matches 622; Conservative 0; Mismatches 83; Indels 1; Gaps 



Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 



126 GCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCC 185 

IIIIIIMIIIIIIIIIIIII lllllllllll IMIIIIIIII Mill llllllll 

1 GCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCCC 60 
186 CCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAA 245 

IIIIIIIMII III llllllllllllllllllll IMM II II II M Mill 

61 CCCGAGCCAGG-GGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACAA 119 
246 CAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGA 3 05 

III llllllllllllll IIIMMI MM lllllllllllllllll Mill II 

12 0 CAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGA 179 
306 CTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGA 365 

IIIMIMIIIMMIIIIIIMMMMIMIMMMII I MM IIIIIMIM 

180 CTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGA 239 
366 TAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGA 425 

II MUM I I llllll llllllll I IIIIIIM II IIIMMI II 

24 0 CAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAGA 299 
426 AGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTT 485 

llllll II Mill M IIIIIIM IIIIIIIIMIIII IIIIMIMIIIIIIII 

3 00 AGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTT 359 
486 AAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCT 545 

IMMMIMIMMMIIMMI MMMIMIIMMM MIMMIM IMM 

360 AAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCT 419 
546 CAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGG 605 

IIIIIMIIIIIIIIIM Mill Mill II IMIM I M IIIIIIIIIIMM 

42 0 CAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGG 4 79 

606 AGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCA 665 

llllllllllllllll III III III II II Mill IIIMIII II Mill 

4 8 0 AGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCA 53 9 

666 CTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGG 725 

M MM IMMMI MMIIIMM M Mill M MIMMIM I I M 

54 0 CTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGG 599 
726 CCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGC 785 



Mini IIIIIIIIIIIIIIIIIIII IIIIIIIIIIIIII IIIMIII IIIIIIII 

Db 600 CCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGC 659 

Qy 786 CACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGC 831 

Mllllllllllllllllllllll HIM lllllllllllllll 

Db 660 CACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGC 705 



RESULT 12 

BQ675698 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BQ675698 902 bp mRNA linear EST 15-JUL-2002 

AGENCOURT_8036532 NIH_MGC_102 Homo sapiens cDNA clone IMAGE : 6211917 
5', mRNA sequence. 
BQ675698 

BQ675698 . 1 01:21786532 
EST. 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordaca; Crania ta; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 902) 

NIH-MGC http; //mgc.nci .nih.gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http: //image. llnl .gov 

Plate: LLCM2375 row: c column: 22 

High quality sequence stop; 5 99. 
Location/Qualifiers 
1. .902 

/organism="Homo sapiens" 
/mol_type="mRNA " 
/db_xref="taxon: 9606" 
/clone=" IMAGE; 6211917" 

/tissue_type="epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/clone__lib="NIH_MGC__102" 

/note="Organ: salivary gland; Vector: pOTB7; Site_l: Xhol; 
Site_2: EcoRI; cDNA made by oligo-dT priming. 
Directionally cloned into EcoRI/XhoI sites using the 
following 5' adaptor: GGCACGAG(G). Library constructed 
by Ling Hong in the laboratory of Gerald M. Rubin 
(University of California, Berkeley) using ZAP-cDNA 
synthesis kit (Stratagene) and Superscript II RT (Life 
Technologies). Note: this is a NIH_MGC Library." 
194 a 292 c 255 g 161 t 



Query Match 46.4%; Score 556; DB 13; Length 902; 

Best Local Similarity 87.7%; Pred. No. 3.8e-116; 



Matches 607; Conservative 0; Mismatches 85; Indels 0; Gaps 



0; 



Qy 


146 


TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 


205 


Db 


1 


1 IIMMMIM IIIIIIMIII lllll IMMMIMMMMIMIIM Mil 

TCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCCCCCCGAGCCAGGGGGAGGTGC 


60 


Qy 


206 


CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 


265 


Db 


61 


llllllllllllilli IMll 11 II M II llllllll IMIIIIIIIIIM 1 

CGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACAACAGCACCCGCGACCGGGTGG 


120 


Qy 


266 


CCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCA 


325 


Db 


121 


IMIIM MM MMMMMMMMI Mill IMMIIMMMIIMIIIM 

CCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGACTACTACGCCAAGGAGGTCA 


180 


Qy 


326 


CCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAATTCAAGGGCACCCCCC 


385 


Db 


181 


IIIMMMIIIIIMIMM 1 MM IIIIIIIIM II IMIII 1 1 1 

CCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGACAAGTTCAAGCAGAGTACAC 


240 


Qy 


386 


ACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTAT 


445 


Db 


241 


Mill llllllll 1 llllllll II llllllll MIIIIM II lllll II 1 
ACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGT 


300 


Qy 

Db 


446 
301 


TGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 

IMIIM IIMIIIIIIIIII IMMMMMMMMMMMMMMMMMM 

TGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 


505 
360 


Qy 


506 


AGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCC 


565 


Db 


361 


MM MMMMMMMMI IIMIIIIII M II II II II 1 II 1 II II 1 II II 1 

AGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGCAC 


420 


Qy 


566 


CCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGA 

MM Mill II IMIII 1 II IMIIIIIIIMIIIMIIIIIIIIMIII III 

CCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGA 


625 


Db 


421 


480 


Qy 


626 


CCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAG 


685 


Db 


481 


III III II II lllll llllllll II IMIIM MM llllllll 1 

GCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGG 


540 


Qy 


bob 


A 1 AACACAL TC CA CGTGCjAAATTAA CGGGTT 

MMIIMM II lllll II llllllllll 1 1 MIIIIM IMIIMIIIIII 

ATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCA 


745 


Db 


541 


600 


Qy 


746 


CCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCC 


805 


Db 


601 


IMIIM llllllllllllll llllllll IIIIIMIIIIIIIIIMMIIIIIIII 

CCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCC 


660 


Qy 


806 


AGCACCTGCACAGCTCCCGGCACCGCCGAGAC 837 

MM MM IIIIIIIMIIIIIIIIII 1 




Db 


661 


AGCATCTGCCAAGCTCCCGGCACCGCCGAGCC 692 





RESULT 13 

BI818841 

LOCUS 

DEFINITION 
ACCESSION 



BI818841 925 bp mRNA linear EST 04-OCT-2001 

603037307F1 NIH_MGC_115 Homo sapiens cDNA clone IMAGE : 5178433 5', 
mRNA sequence. 
BI818841 



VERSION 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BI818841.1 GI :15930391 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebra ta; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 925) 

NIH-MGC http: //mgc .nci .nih.gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs -r@mail .nih.gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : //image . llnl .gov 
Plate: LLAM11444 row: n column: 02 
High quality sequence stop: 874. 

Location/Qualifiers 

1. .925 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref ="taxon: 96 06" 
/clone="IMAGE: 5178433" 
/lab__host = "DH10B" 
/cl one__l ib= " NI H_MGC_1 1 5 " 

/note="Organ: pooled brain, lung, testis; Vector: 
pCMV-SP0RT6; Site_l: NotI ; Site_2 : EcoRV (destroyed); RNA 
source anonymous pool of 6 male brains, age range 23-2 7; 1 
male lung, age 27; and 1 male testis, age 69. Library is 
oligo-dT primed and directionally cloned (EcoRV site is 
destroyed upon cloning). Average insert size 1.8 kb, 
insert size range 1-3 kb. Library is normalized and 
enriched for full-length clones and was constructed by C. 
Gruber (Invitrogen) . Research Genetics tracking code 
021. Note: this is a NIH_MGC Library." 
170 a 316 c 278 g 161 t 



Query Match 44 . 6%; 

Best Local Similarity 87.6%; 
Matches 607; Conservative 



Score 534.2; DB 12; Length 925; 
Pred. No. 3.5e-lll; 
0; Mismatches 83; Indels 3; 



Gaps 



2; 



Qy 



UD 



1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III MM II lllllllllll III II III I MM I Ml III MM III II Ml 

151 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 210 



Qy 
Db 

Qy 

Db 



61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

IIIIIIIIMIIIIIIIIIIIII Mill IMMMMMIM IIIIMMIIMMI 

211 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 270 
121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

IIIIIIIMIIIMIIMIIIIIIII IIIMIIIIII IIIIMIIIM Mill III 

271 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 330 



Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

llllllllllllllllllll IIIIIIIIIIMIIMIIM Mill II II II II 

Db 331 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 390 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

IMMIM IIIMIIIIIMM IIMIIM MM IMMMIMMMMI Ml 

Db 3 91 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 45 0 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II IMIIIIIIMMMIIIIIIIIIIIIIIIIIIIIIIIIIMI I MM IIIM 

Db 4 51 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 510 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

IIIM II Mill! I I llllll IMMIM I IIIIMII M IMMI 

Db 511 TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 57 0 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 48 0 

II IIIIMII M Mill M IMMIM IIMMIIIIIIM MMMMMM 

Db 571 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 630 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

MMMMMMMMMMMMMMI IMIIIIIIIMIIIII Mllllllll 

Db 631 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 6 90 

Qy 541 TACCTCAGCAACCGGCTGC-TGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGT 599 

IMIIIMIIIIIIIIIII MM Mill Mill II llllll I M IMMIM 

Db 691 TACCTCAGCAACCGGCTGCTTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGT 750 

Qy 600 CACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGA--GAGGCTATAGAGGGTTTTCGCCTC 657 

MMMMMMMMMMM III III II II II Mill IMMIM 

Db 751 CACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGACGGGGAACATTGAGGGCTTTCGCCTT 810 

Qy 658 AGTGCCCACTCTTCCTCTGACAGCAAAGATAAC 690 

II MIIIM MM IMMIM MM I 

Db 811 AGCGCCCACTGCTCCTGTGACAGCAGGGATACC 843 



RESULT 14 

BQ054305 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



NIH 



1093 bp 
MGC 106 Homo 



mRNA 
sapiens 



linear EST 29-MAR-2002 
CDNA clone IMAGE: 5936397 



BQ054305 

AGENCOURT__683 04 09 
5', mRNA sequence. 
BQ054305 

BQ054305.1 01:19813645 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta ; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1093) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Ema i 1 : cgapbs - r@ma i 1 . n ih . gov 

Tissue Procurement: Dr. Daniel McVicar, DBS/NCI 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E, Consortium (LLNL) 

DNA Sequencing by: Agencourt Bioscience Corporation 

Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/ LLNL at: 

http : //image . llnl .gov 

Plate: LLCM2125 row: k column: 22 

High quality sequence stop: 605. 
Loca t ion/ Qual i f iers 
1. .1093 

/organism="Homo sapiens" 
/ mo 1 _t yp e = " mRNA " 
/db_xref ="taxon: 9606" 
/clone="I^4AGE: 5936397" 

/tissue_type="natural killer cells, cell line" 
/lab_host="DH10B (phage-resistant) " 
/ c 1 one_l ib= " NI H_MGC_ 106" 

/note="Organ: blood; Vector: pOTB7; Site_l: Xhol ; Site_2: 
EcoRl ; cDNA made by oligo-dT priming. Direct ionally cloned 
into EcoRI/XhoI sites using the following 5' adaptor: 
GGCACGAG(G) . Library constructed by Ling Hong in the 
laboratory of Gerald M. Rubin (University of California, 
Berkeley) using ZAP-cDNA synthesis kit (Stratagene) and 
Superscript II RT (Life Technologies) . Note: this is a 
NIH_MGC Library. " 
233 a 347 c 309 g 203 t 1 others 



Query Match 43.5%; Score 521.2; DB 12; Length 1093; 

Best Local Similarity 87.8%; Pred. No. 3.4e-108; 

Matches 590; Conservative 0; Mismatches 58; Indels 24; Gaps 



1; 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



525 CAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTG 584 

III llllllllll MIIIIIIIIIIIMIIIIIIII Mill Mill II MMI 

1 CAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTG 60 
585 GCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGA 644 

I I II IIIMIIIMMIIIMIIIIMIIIIMI III III Ml II II II 

61 GTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGA 120 

645 GGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGA 704 

III llllllll II lllllll MM Mllllll IMMIMIII II Mill 

121 GGGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGA 18 0 
705 AATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCG 764 

II MMIIIIII I I IMIIIM IIIIIIIIIIIMIIIIIII IIMIIIIIII 

181 CATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCG 24 0 



wy 

Db 

Qy 

Db 



76b GCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCG 824 

NI llllllll IIIIIIIIIIIIIIIMIIIMIIIIIIIIII Mill llllllll 

241 GCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCG 3 00 
825 GCACCGCCGAGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAG 884 

IMIIMMI llllllll llllllll MIIIMI 

301 GCACCGCCGA GCCCTGGACACCAACTATTGCTTCAG 336 



Qy 



885 CTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGG 944 



Db 



MMIIIIIMIIIIIMIMMIIIIIMIM IIIMMIIIIIM Mllllll II 

337 CTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGG 396 



Qy 


945 


CTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCC 

IIIIIIMIIII II II IIIIIIIIIIMIIIIIIII Mllllll llllllll II 

CTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCC 


1004 


Db 


397 


456 


Qy 


1005 


CTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAA 

IIIIM Mllllll lllll IIIIIIIIIIIIIIIIIMI llllllllllllll II 

CTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAA 


1064 


Db 


457 


516 


Qy 


1065 


CCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGT 

IIMIIIM IMIMMMIIIMIIIIIIMIIIIIIIMIIIIII IIMIMIIM 

CCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGT 


1124 


Db 


517 


576 


Qy 
Db 


1125 
577 


GTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTG 

IIMIMMMIIIMIIMMIMIMMIMIMIIMMIIMIIMIMI lllll 

GTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTG 


1184 
636 


Qy 


1185 


CAAGTGCAGCTG 1196 

llllllllllll 
CAAGTGCAGCTG 648 




Db 


637 





RESULT 15 

AL530080/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Primates; Catarrhini; Hominidae; Homo. 



AL530080 841 bp mRNA linear EST 23-MAY-2003 

AL530080 Homo sapiens NEUROBLASTOMA COT 50 -NORMALIZED Homo sapiens 
CDNA clone CS0DD009YM06 3 -PRIME, mRNA sequence. 
AL530080 

AL530080.2 GI :31067915 
EST. 

Homo sapiens (human) 
Homo sapiens 
Eukaryota; Metazoa; 
Mammalia; Eutheria; 
1 (bases 1 to 841) 

Li,W.B., Gruber,C., Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 13, 2001 this sequence version replaced gi: 12793573. 

Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
was normalized. Library was constructed by Life Technologies, a 
division of Invitrogen. This sequence belongs to sequence cluster 
9160. r For more information about this cluster, see 
http : //www, genoscope . ens . f r/ 

cgi-bin/cluster .cgi?seq=CS0DD009BG03NPl&cluster=9160 . r. Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DD009BG03NP1 . 

Locat ion/Qua 1 i f iers 

1. .841 

/organism="Homo sapiens" 
/ mo 1_ t yp e = " mRNA " 



/db_xref="taxon: 9606" 

/clone="CS0DD0 09YM06" 

/tissue_type="NEUROBLASTOMA COT 50 -NORMALIZED" 
/clone__lib="Homo sapiens NEUROBLASTOMA COT 50 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 150 a 240 c 283 g 153 t 15 others 

ORIGIN 



Query Match 43.5%; Score 520.2; DB 9; Length 841; 

Best Local Similarity 84.0%; Pred. No. 5.3e-108; 

Matches 647; Conservative 7; Mismatches 86; Indels 30; Gaps 5; 



Qy 

Db 


428 
841 


CGGTGCCGGAACCTGTATTGCTCT - CTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTA 

nil II hill 1 llhlll 1 IIIIIIIIIIMI lllll llllllllllll 
CGGTACCTGMACCCTTGTTGYTCTCCCCGGGCAGAGCTGCTTCTGCTAAGGCTCAAGTTA 


486 
782 


Qy 


487 


AAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTC 


546 


Db 


781 


IMIIIIIIIIIIIMIIIIIMIIIIIIIIIMIIIMI IhllMIII IIIMI 

AAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATWCCTGGCGATACCTC 


722 


Qy 


547 


AGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGA 

iiiiiihii mill lllll :iiii II mill 1 II MM iiiiiiihi 

AGCAACCSGCSGCTGGCACCCAGCSACTCGCCAGAGTGGTTTTCTTTTGTTGTCACCGKA 


606 


Db 


721 


662 


Qy 


607 


GTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCAC 

lllllllllllllll III III III 11 II III Mllllll M Ml II 

GTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTAAGG--TTTCGCCTTAGCGCCNAC 


666 


Db 


661 


604 


Qy 


667 


TCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGC 
1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II III 1 II II 1 1 1 1 1 1 1 1 1 III 


726 


Db 


603 


Mil 1 M M M M 1 M M 1 1 M M Ml 1 M M M M 1 1 1 1 1 Ml 

GGCTCCTGTAACAGCAGGGATAACACACTGCAAGTGAACATCAACGGGTTAACTACCGGC 


544 


Qy 


727 


CGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCC 

lllll IIIIIMIIIIIIIIIIII IIIMIIIIIIIII Mllllll IIIIMIM 

CGCCGAGGTGACCTGGCCACCATTCTTGGCATGAACCGGCCTTTCCTGCTTCTCATGGCC 


786 


Db 


543 


484 


Qy 


787 


ACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGACTACAAGGAT 

IIIIIIIIIIIIIIIIIIMII lllll IIIIIIIIIIIMIIIII 
ACCCCGCTGGAGAGGGCCCAGCTTCTGCAAAGCTCCCGGCACCGCCGA 


846 


Db 


483 


436 


Qy 


847 


GACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGC 

llllllll IIMIIM IIIMMMIIIMIMMMIMMIMI 

GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGC 


906 


Db 


435 


388 


Qy 


907 


GTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCC 

mm Ml II mm iiiiiii mmii mm iiiiiii ii ii m 

GTGCGGCAGCTGTACATTAACTTCCGCAAGGACCTCGGCTGGGAGTGGATCCACGAGCCC 


966 


Db 


387 


328 


Qy 


967 


AAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACT 


1026 


Db 


327 


llllMhIIIIIIIII Mllllll MMMII IIIIMII llllllll Mill 

AAGGGCTWCCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACG 


268 


Qy 


1027 


CAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCG 


1086 



III illillllllllllll llllllllllllll IIIIIII II llllllllllll 



Db 267 CAGGACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGG--GCCTCGGCGGCGCCG 210 

Qy 1087 TGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCC 114 6 

Ml IMIMIMIIIMMMMII lIMM M M I M I M M 1 1 M 1 1 M 1 1 M M 

Db 2 09 TGC-GCGTGCCGCAGGCGCTGGAGCCGCTGCCCTTCGTGTACTACGTGGGCCGCAAGCCC 151 

Qy 114 7 AAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTG 1196 

lllllll II lllllllllllll lllllll lllllllll II Mil 
Db 150 AAGGTGGGGCGGCTGTCCAACATGGTCGTGCGCTCCTGCAAGGGCCGCTG 101 



Search completed: October 28, 2003, 00:08:04 
Job time : 2979.42 sees 



GenCore version 5.1.6 
Copyright (c) 19 93 - 2 003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: October 21, 2003, 19:11:58 



; Search time 33.2026 Seconds 

(without alignments) 

1902.657 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



US-10-017-372E-9 
2114 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNMI VRSCKCS 398 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



1107863 seqs, 158726573 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -process ing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1107863 



Database 



A_Geneseq_19 Jun03 : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 

ly 

20 
21 
22 
23 
24 



/SIDSl/gcgdata/geneseq/geneseqp-embl/AA198 0 . 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1981 . 
/SIDSl/gcgdata/geneseq/geneseqp-'embl/AA1982 . 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1983 . 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1984 . 
/S I DS 1 /gcgdata/geneseq/geneseqp -embl /AAl 98 5 , 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1986 . 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1987 . 
/SI DSl /gcgdata/geneseq/geneseqp -embl /AAl 98 8 . 
/S I DSl /gcgdata/geneseq/geneseqp -embl /AAl 98 9 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1990 
/ S I DS 1 /gcgdata/geneseq/geneseqp - embl /AAl 991 
/SI DSl /gcgdata/geneseq/geneseqp -embl /AAl 9 92 
/SI DSl /gcgdata/geneseq/geneseqp -embl /AAl 9 93 
/SI DSl /gcgdata/geneseq/geneseqp -embl /AAl 9 94 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1995 
/ S I DS 1 / gcgda t a / genes eq/genes eqp - embl /AA 1996 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1997 
/S IDS 1 /gcgdata/geneseq/geneseqp- embl /AA1998 
/SI DSl /gcgdata/geneseq/geneseqp -embl /AAl 999 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA2000 
/ S I DS 1 /gcgda t a / genes eq/gene s eqp - embl /AA2 001 
/ S I DS 1 / gcgda ta /genes eq/geneseqp - embl /AA2 002 
/ S I DS 1 /gcgda t a /genes eq/genes eqp - embl /AA2 003 



DAT 
DAT 
DAT 
DAT 
DAT 
DAT 
DAT 
DAT 
DAT 
.DAT: 
.DAT: 
.DAT: 
.DAT: 
.DAT: 
.DAT: 
.DAT: 
.DAT: 
.DAT: 
. DAT : 
.DAT: 
.DAT: 
.DAT: 
.DAT: 
.DAT: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed. 



and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 

1 


2053 


9 / . 


. 1 


o Q n 

J y u 


23 


AAbUby D 


Porcine trans formi 


2 


1931 


91 . 


. 3 


390 


22 


AAM3 9186 


Human polypeptide 


3 


1924 


9 1 . 


. 0 


o n A 

J y 0 


7 


AAr614 bo 


PreTGF-beta gene p 


A 

4 


1924 


yi 


. 0 


J y u 


1 1 


TVAOA/I AD 

AAK\j4 03 4 


Sequence of pre-TG 


5 


1924 


91 . 


, 0 


390 


11 


AAR052 58 


Human pre -trans for 


6 


1924 


91 . 


. 0 


3 90 


12 


AAR13813 


Human pro -TGP -beta 


7 


1924 


91 . 


. 0 


390 


16 


AAR73596 


Human TGF-beta 1 p 


8 


1924 


9 1 . 


. 0 


3 9 0 


17 


AAR9U82 7 


Pre- trans forming g 


9 


1921 


90 


. 9 


390 


23 


AAU77101 


Human transforming 


10 


1921 


90 . 


. 9 


O Q A 

J y u 


23 


AArilDy43 


Human transforming 


i 1 


i y 2 u . b 


o r\ 

y u 


Q 

. o 


TOT 

J y 1 


Z 4 


Addo 2 / O U 


itrrDi ArgzbPro poi 


12 


1919 


90 . 


. 8 


O A A 

3 9 0 


13 


AAR2 012 4 


Sequence of simian 


13 


1915 


9 0 


. 6 


"5 A A 

3 9 0 


15 


A An/1 z^'^OT 

AAR4 62 2 7 


Human pre -TGF-beta 


14 


1913 - 5 


90 . 


. 5 


391 


16 


AAR83054 


Transforming growt 


15 


1913 


90 . 


. b 


"3 O A 

3 90 


1 A 

1 9 


AAW / 8 / 0 b 


Human pre -trans for 


16 


T n 1 o 

1912 


90 


. 4 


1 A A 

390 


22 


AADO/l ^A1 

AAB84501 


Nucleotide sequenc 


11 


1909.5 


90 . 


. 3 


3 91 


9 


AAP8 1362 


Human transforming 


18 


ly Oo . b 


yo 


. J 


4 J4 


1 1 


AAR03 /43 


Monkey trans formin 


19 


1905 


90 . 


. 1 


3 90 


2 4 


ABB82 78 1 


TGFBl Arg25Pro pol 


20 


1888 


8 9 . 


. 3 


386 


1 1 


TV TV n /A / k 

AAR 05663 


Simian Transformin 


Z 1 


1 o o o cr 
loo2 . b 


0 y . 


, u 


Jo / 


1 1 


AAK0bDb4 


Simian Transformin 


22 


18 I Z 


o o 
88 , 


. 6 


"5 A A 

390 


1 1 


A AnAC/l An 

AAR054 92 


Chimeric simian TG 


23 


T O O 

1858 


88 


. 4 


T O A 

390 


13 


A AT50'7irA'^ 

AAR27522 


TGF-beta l/beta 2 


24 


184 8 . 5 


o 7 . 


. 4 


38 9 


1 3 


AAR2 9657 


TGF-beta 1. Homo 


2 b 


lo4o , b 


o / . 


. 2 


4 b J 


ZZ 


AAiyi4 uy /2 


Human polypeptide 


26 


1 /OJ 


o o 


. 4 


O Q A 

J y u 


13 


AAR2 012 6 


Sequence of hybrid 


z / 


1 "7 '5 

L / OZ 


Q "3 
O J , 




"3 Q O 

J y u 


-1 T 

1 i 


AAKU b /4 y 


Human iLjr-t5eLa2 ex 


28 


1 /bb 


83 . 


. 1 


3 90 


1 1 


AARO 56 6 5 


Human Transforming 


2 9 


1754 


83 . 


. 0 


3 90 


1 1 


AARO 5566 


Hybrid transformin 


O U 


1 / fi b . b 


oZ . 


. b 


J y 1 


1 u 


7*kADOl QAA 

AAr y 1 y u u 


Sequence encoded b 


31 


1718 . 5 


81 , 


.3 


389 


16 


AAR79921 


Simian-human hybri 


32 


1300 


61 , 


, 5 


278 


15 


AAR53090 


Polypeptide cross - 


33 


1295 


61. 


.3 


278 


12 


AAR12541 


Latency associated 


34 


1262.5 


59. 


.7 


458 


23 


ABG31507 


LAP-mlFNB construe 


35 


1262.5 


59, 


.7 


463 


23 


ABG31510 


LAP-huIFNB constru 


36 


1173.5 


55. 


.5 


290 


22 


ABG06792 


Novel human diagno 


37 


1135 


53 , 


,7 


450 


23 


ABG31508 


mlFNB-LiAP construe 


38 


1118 


52 . 


.9 


448 


23 


ABG31509 


huIFNB-LAP constru 


39 


944 


44 . 


. 7 


227 


22 


ABG2 0234 


Novel human diagno 


40 


902 . 5 


42 . 


. 7 


236 


22 


ABG20233 


Novel human diagno 


41 


873 


41 . 


,3 


382 


21 


AAB08338 


Amino acid sequenc 


42 


872 


4i . 


, i 


382 


23 


AAU77105 


Frog transforming 


43 


837 


39. 


, 6 


456 


19 


AAW78786 


Pig transforming g 


44 


835 


39. 


.5 


412 


16 


AAR73598 


Human TGF-beta 3 p 


45 


832 


39. 


,4 


412 


13 


AAR20621 


Transforming Growt 



ALIGNMENTS 



RESULT 1 

AAE13596 

ID AAE13596 standard; Protein; 390 AA . 
XX 

AC AAE13596; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) mutant - 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 

KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 

KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 

KW diabetes mellitus; sarcoidosis; psoriasis; dermatological; mutant; 

KW mutein. 
XX 

OS Sus scrof a . 
XX 

FH Key Location/Qualifiers 

FT Misc-diff erence 223 

FT /note= "Wild type Cys substituted with Ser" 

FT Misc-dif ference 225 

FT /note= "Wild type Cys substituted with Ser" 

XX 

PN WO200181404-A2. 
XX 

PD Ol-NOV-2001. 
XX 

PF 20-APR-2001; 2001WO-US12980 . 
XX 

PR 20-APR-2000; 2000US-199014P . 
XX 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ; 
XX 

DR WPI; 2002-026155/03. 

DR N-PSDB; AAD22696. 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 

PT disease in humans, comprises vector containing transforming growth 

PT factor-beta under the control of inducible promoter 
XX 

PS Example 1; Fig 1; 78pp; English. 
XX 

CC The invention relates to a composition containing a vector comprising a 

CC gene encoding a regulatory transcription factor under the control of a 

CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vecLui IB useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transfecting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease (IBD), under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered using a delivery system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-lO by the host. The 

CC composition is useful for treating various diseases with an autoimmune 



cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



component such as multiple sclerosis, rheumatoid arthritis, systemic 
lupus erythematosus, insulin-dependent diabetes mellitus, sarcoidosis 
and psoriasis, and also for assaying the expression of a gene in a cell. 
The vector is further useful for screening of the effect of test 
compounds on cytokine (e.g. TGF-beta) expression of transfected cells. 
The present sequence is porcine TGF-betal mutant. 



Sequence 3 90 AA; 



Query Match 97.1%; Score 2053; DB 23 

Best Local Similarity 98.0%; Pred. No. 1.3e-174 
Matches 390; Conservative 0; Mismatches 0 



Length 3 90; 

Indels 8; Gaps 1; 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

IIIMIIIIIMIIIIIMIIIIIIMMIIMIIIIMIMIIMIIIIIIMIIIIII 

1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRIA 60 

61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

IIIMIIIIIIIIIIIIMIIIIIIIIIIIIMIIMIIMIIIIIIIIIIMIIIIIII 

61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

IIIMIIIIIIIMIIIMMIIIMMIMIIIIIIIMIIIIMIIIIIMIIMIII 

121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

MIIIIIIIIIMIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIMMMIIIIIIIII 

181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIII IMIIIIIIIIIII 

Db 241 SGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IMIIMIIMMIIMIilMMIlMMIIIIMMIMIIMIIIIIIMMMIII 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 2 

AAM39186 

ID AAM39186 Standard; Protein; 390 AA. 
XX 

AC AAM39186; 
XX 

DT 22-OCT-2001 (first entry) 
XX 

DE Human polypeptide SEQ ID NO 2331. 
XX 

KW Human; nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 

KW peripheral nervous system; neuropathy; central nervous system; CNS; 

KW Alzheimer's; Parkinson's disease; Huntington's disease; haemostatic; 

KW amyotrophic lateral sclerosis; Shy-Drager Syndrome; chemotactic; 

KW chemokinetic; thrombolytic; drug screening; arthritis; inflammation; 



KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 

PI 
PI 
PI 

XX 

DR 
DR 
XX 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



leukaemia . 
Homo sapiens. 
WO200153312-A1 . 
26-JUL-2001 . 

26-DEC-2000; 2000WO-US34263 . 



21-JAN-2000; 
25-APR-2000; 
09-JUL-2000; 
19-JUL-2000/ 
03-AUG-2000; 
14-SEP-2000; 
19-OCT-2000; 
29-NOV-2000; 



2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 



0488725 . 
0552317 . 
0598042 . 
0620312. 
0653450 . 
0662191. 
0693036. 
0727344 . 



(HYSE-) HYSEQ INC. 

Tang YT, Liu C, 
Wang J, Wang 
Zhao QA, Zhou P, 



Asundi V, Chen R, Ma Y, Qian XB, Ren F, Wang D; 
Wehrman T, Xu C, Xue AJ, Yang Y, Zhang J; 
Goodrich R, Drmanac RT; 



WPI; 2001-442253/47. 
N-PSDB; AAI58342. 

Novel nucleic acids and polypeptides, useful for treating disorders 
such as central nervous system injuries - 

Example 4; SEQ ID NO 2331; 10078pp; English. 

The invention relates to human nucleic acids (AAI57798 -AAI61369) and 
the encoded polypeptides (AAM38642-AAM42213 ) with nootropic, 
immunosuppressant and cytostatic activity. The polynucleotides are useful 
in gene therapy. A composition containing a polypeptide or polynucleotide 
of the invention may be used to treat diseases of the peripheral nervous 
system, such as peripheral nervous injuries, peripheral neuropathy and 
localised neuropathies and central nervous system diseases, such as 
Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager Syndrome. Other uses include the 
utilisation of the activities such as: Immune system suppression, 
Activin/ inhibin activity, chemotact ic/chemokinet ic activity, haemostatic 
and thrombolytic activity, cancer diagnosis and therapy, drug screening, 
assays for receptor activity, arthritis and inflammation, leukaemias and 
C.N.S disorders. 

Note: The sequence data for this patent did not form part of the printed 
specif icacion . 



Sequence 390 AA; 

Query Match 91.3%; Score 1931; DB 22; 

Best Local Similarity 92.0%; Pred. No. le-163; 
Matches 366; Conservative 10; Mismatches 14; 



Length 3 90; 
Indels 8; 



Gaps 



1; 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 



Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


293 


Qy 


361 


Db 


353 



I IMMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIII 

MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEklRGQlLSKLRlA 



IIIMhlllllllllllllllllllllMII IIIIIIIIIIMIIMIIIIIh |:| 



60 



INI 



hll IIIIMIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIhlll 



lllllllllllllllllllllllllll 



Mill 



Ml 



III 



SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

MIIIIIIIIIIIIIIIIIIIIIIIMMM IIMII MIIIIIMIIIII 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 2 92 



Mill 



IIIIIIMIMMMMIMIIIIIMMMIMMMMIIII 



IIIIIIIMMIIIIIIIIIMIIIIMIIIIIII 



AAP61468; 

31-OCT-2 002 (updated) 
28-OCT-1991 (first entry) 

PreTGF-beta gene product. 

Transforming growth factor beta; cancer; wound healing. 
Unidentified. 



RESULT 3 
AAP61468 

ID AAP61468 Standard; Protein; 390 AA. 
XX 
AC 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
FH 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
XX 
PA 
XX 



Key 

Protein 
EP200341-A. 
10-DEC-ly86 . 

21- MAR-1986; 

22- MAR-1985; 
13-MAR-1987; 



Location/Qualifiers 
279. .390 



86EP-0302112 . 

85US-0715142 . 
87US-0025423. 



(GETH ) GENENTECH INC. 



PI Derynck RMA; 
XX 

DR WPI; 1986-326875/50. 

DR N-PSDB; AAN60972. 
XX 

PT TGF-beta prodn. from transformed hosts - useful esp. for treating 

PT wounds (J6 2/9/86) . 

XX 

PS Disclosure; Fig lb; 26pp; English, 
XX 

CC The gene product is known to stimulate cell proliferation and 

CC inhibit anchorage-dependent growth of a variety of human cancer cell 

CC lines, it is esp. useful in treatment of burns and the promotion of 

CC surface and internal wound healing. TGF-beta may be expressed from a 

CC transformed CHO cell line. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 
XX 

SQ Sequence 390 AA; 



Query Match 91.0%; Score 1924; DB 7; Length 390; 

Best Local Similarity 91.7%; Pred . No. 4-4e-163; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 



Qy 


1 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 

1 llllllllllllllllllllll MIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 


60 


Db 


1 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 


120 


Db 


61 


lllllhlllllllllllllllllllllllll IIIMIIIIMIIIIIIMIIh hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

Mill : Ihll IIIIIIIIIMIIIIIIIIIIIIIIIMIIIIMIIIIIIIhlll 

YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 


180 


Db 


121 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 

lllllllllllllllllllllllllllhl lllllllll 1 Ihllll hill! 
YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 


240 


Db 


181 


240 


Qy 


241 


SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 


300 


Db 


241 


Ulllllllllllllllllllllllllllll llllll llllllllllllll 
TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 


292 


Qy 


301 


CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 

llllllllllllll II MM II llllllll III lllllllllllllllllllllllllll 


360 


Db 


293 


CCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 


352 


Qy 


361 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

i 1 i i 11 1 i M M i 1 M M 1 i M M M M i 1 M M II II 

APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 




Db 


353 





RESULT 4 
AAR04034 

ID AAR04034 standard; protein; 390 AA. 
XX 

AC AAR04034; 



XX 

DT 25-MAR-2003 (updated) 

DT 31-OCT-2002 (updated) 

DT 31-MAY-1989 (first entry) 
XX 

DE Sequence of pre-TGF-beta 1. 
XX 

KW Transforming growth factor beta-3 (TGF beta 3); tumour cells; growth 

KW inhibition. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 348. .500 

XX 

PN WO8912101-A. 
XX 

PD 14-DEC-1989. 
XX 

PF 08-JUN-1988; 88WO-US01945 . 
XX 

PR 08-JUN-1988; 88WO-US01945 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Dernyck RMA, Goeddel DV; 
XX 

DR WPI; 1990-007474/01. 

DR N-PSDB; AAQ02815. 
XX 

PT Nucleotide sequence encoding transforming growth factor beta-3 used as a 

PT probe, or to produce TGF beta 3, for inhibiting growth of certain normal 

PT and neoplastic cells, eg A549. 
XX 

PS Disclosure; Fig. 2; 61pp; English. 
XX 

CC Sequence is an axon of transforming growth factor-beta 1 (pre-TGF-beta 

CC 1) polypeptide and corresponds to AA's 288-338 of mature TGF-beta 1. The 

CC nucleic acid encoding second subtype of TGF-beta (TGF-beta 3) is useful 

CC as a probe or to produce TGF-beta 3 for inhibition of normal and 

CC neoplastic cell growth. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 

CC (Updated on 25-MAR-2003 to correct PR field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 
XX 

SQ Sequence 390 AA; 

Query Match 91.0%; Score 1924; DB 11; Length 390; 

Best iiocal Similarity 91.7%; Pred. No. 4.4e-163; 

Matches 3 65; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I MIIIIIIIIIMIIIIIIIII IIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

lllllhlllllllllllllllllllllllll llllllllllllllllllllll: |:| 



Db 



61 SPPSQGEVPPGPLPEAVLALyNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 



Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill M:|l MIMIIIMMIIMIIIIMMMIMMMIMIMIIhlll 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

lllllilll I Ihllll hllll 
Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

UIIIIIIIIIIIIIIIIMIIIIIIIIIM IMIII IIIIMIIIIIIII 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 292 

Qy 3 01 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

MIMMIMMMIMIIIIMMMIMMMIIMIMMMIMIIMMIM III 

Db 2 93 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPTVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

IIIIIIMIIIIIIIIIIMIIIMIIMIIIIIIIII 

Db 353 APCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 5 




AAR05258 




ID 


AAR05258 Standard; protein; 390 AA. 


XX 






AC 


AAR05258; 




XX 






DT 


25-MAR-2 003 (updated) 


DT 


05-AUG-1990 (first entry) 


XX 






DE 


Human pre-transforming growth factor-beta-1 (pre-TGF-beta-1) 


XX 






KW 


Transforming growth factor-beta-1 (TGF-beta-1) ; 


KW 


neoplastic cell 


line inhibition; 


KW 


EGF -potentiated 


anchorage- independent growth. 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Location/Qualifiers 


FT 


Peptide 


1. .278 


FT 


Protein 


279. .2011 


FT 


Domain 


8 . .23 


FT 




/note=" hydrophobic domain" 


FT 


Modif ied-site 


82. .84 


FT 




/note= "potential N-glycosylat ion site " 


FT 


Modif led-site 


136 . , 138 


FT 




/note="as above" 


FT 


Modif ied-site 


176 . . 178 


FT 




/note="as above" 


FT 


Cleavage- site 


277. .278 


FT 




/note="proteolytic cleavage site" 


XX 






PN 


US4886747-A. 




XX 






PD 


12-DEC-1989. 





PF 13-MAR-1987; 87US-0025423 . 
XX 

PR 13-MAR-1987; 87US- 002 5423 . 

PR 22-MAR-1985; 8 BUS- 0715142 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1990-051338/07. 

DR N-PSDB; AAQ93301. 
XX 

PT Nucleic acid encoding transforming growth factor-beta - 

PT cloned into expression vectors for expression in eukaryotic host 

PT cells for therapeutic use 

XX 

PS Disclosure; Fig lb; 28pp; English. 
XX 

CC Pre-TGF-beta-1 is a polypeptide of 390 amino acids. Post- transitional 

CC cleavage of the precursor gives rise to the mature TGF-beta monomer. 

CC The sequence for human TGF-beta was determined by direct amino acid 

CC sequence analysis and by deduction from the TGF-beta cDNA. It is 

CC capable of inducing EGF-potentiated anchorage- independent growth of 

CC target cell lines, and/or growth inhibition of neoplastic cell lines. It 

CC can be used for treating wounds, eg burns or epidermal ulcers. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 390 AA; 

Query Match 91.0%; Score 1924; DB 11; Length 390; 

Best Local Similarity 91.7%; Pred. No. 4.4e-163; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I IIIIIIIMIIIIIIIIIIIII IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

lllllhlllllllllllllllllllMIIII IIIIIIIIIMIIIMIIIIIh |:| 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll IIIIIMIIIIIIIIIIIIIIIIIMIMIIIIIIMIIIIhlll 
Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

MlllllilliiliMliijiiiMllh! IIIIIMII I Ihllll hllll 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLOVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 3 00 

UIIIIMIIIIIIMIIIIIIIIIIIIIII llllll IIIIIIIIIIMM 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 2 92 



Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIIIIIIIMIMIIIMIIIIIIIMMIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 293 CCVRQLYIDFRKDLGVJKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 



Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIII 
Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 6 
AAR13813 

ID AAR13813 standard; Protein; 390 AA. 
XX 

AC AAR13813; 
XX 

DT 20-NOV-1991 (first entry) 
XX 

DE Human pro-TGF-beta 1. 
XX 

KW Osteogenetic; tumoricidal. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT Peptide 1 . .29 

FT /note= "signal peptide" 

FT Peptide 3 0. .3 90 

FT /note= "pro-TGF-beta 1" 

FT Peptide 279. .390 

FT /note= "TGF-beta 1" 

XX 

PN JP03180192-A. 
XX 

PD 06-AUG-1991. 
XX 

PF 07-DEC-1989; 89JP-0318243 . 
XX 

PR 07-DEC-1989; 89JP-0318243 . 
XX 

PA (KIRI ) KIRIN BREWERY KK. 
XX 

DR WPI; 1991-271579/37. 
DR N-PSDB; AAQ133 92. 
XX 

PT Human pro-TGF-beta 1 prodn., for osteo-genetic activity - by 
PT preparing DNA chain contg. base sequence coding for human 
PT pre: pro-TGF-beta 1, forming expression vector etc. 
XX 

PS Claim 1; Fig 1; 16pp; Japanese. 
XX 

CC The amino acid sequence codes for human prepro-TGF-beta 1 which 
CC can be produced by recombinant methods, it has osteogenetic and 
CC tumoricidal activity. 
XX 

SQ Sequence 3 90 AA; 



Query Match 91.0%; Score 1924; DB 12 

Best Local Similarity 91.7%; Pred. No. 4.4e-163 
Matches 365; Conservative 10; Mismatches 15 



Length 390; 

Indels 8; Gaps 1; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IMIIMMMIMIMMIM I M I M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 M I M 1 1 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRPCRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I M M h 1 1 1 M M i 1 1 1 1 1 1 11 1 11 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M I M 1 1 M M hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll IIIIIMIIIMIIMIIIIIMIMMIIIIMIIIIIIIhMI 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

IIMIMIIIMMIIIIIIIllllllhl IIMlllll I Ihllll hllll 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRIjSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

UIIMMIIIMIIIIIIMIIIIIIIIII MINI MIMIIMIIIIi 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

MIIIIIIMMIMMMIIIIIIllllllllllllllllMIIMIIMMIMMM 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 398 

IIIIIIMIIMIIIIIIIIIMIIIIIIIIIIIMM 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 7 




AAR73596 




ID 


AAR73596 standard; Protem; 


XX 






AC 


AAR73596; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


20-DEC-1995 


(first entry) 


XX 






DE 


Human TGF-beta 1 protein. 


XX 






KW 


Transforming growth factor-t 


KW 


TGF-beta 3; osteogenic cell 


J<W 


bone- inducing 


cof actor . 


XX 






OS 


Homo sapiens. 




XX 






PN 


US5409896-A. 




XX 






PD 


25-APR-i995 . 




XX 






PF 


12-NOV-1993; 


93US-0132405 . 


XX 






PR 


Ol-SEP-1989; 


89US-0401906. 


PR 


12-NOV-1991; 


91US-0790856. 


PR 


18-MAY-1993; 


93US-0063841 . 


PR 


12-NOV-1993; 


93US-0132405. 


XX 







PA (GETH ) GENENTECH INC. 
XX 

PI Ammann AJ, Rudman CG; 
XX 

DR WPI; 1995-169610/22. 
XX 



PT Compsn. for treating skeletal tissue deficiency - comprising 

PT transforming growth factor-beta and an osteogenic cell source in a 

PT carrier 

XX 

PS Claim 3; Column 15-18; 19pp; English. 
XX 

CC This sequence represents human transforming growth factor-beta 1 

CC (TGF-beta 1) . This sequence and the sequence for human TGF-beta 3 (see 

CC AAR73598) are claimed within the scope of the invention. The invention 

CC is a composition consisting of a TGF-beta protein and an osteogenic cell 

CC source (OCS) formulated in an acceptable carrier other than a bone 

CC morphogenic cof actor. This composition can be used for the restoration 

CC of bone deficiency. This provides for the generation of mature bone 

CC only where it is required, without the inclusion of a specific 

CC bone-inducing cof actor. This method can be used with any of the 5 human 

CC TGF-beta 's or with TGF-beta from other species. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 390 AA; 

Query Match 91.0%; Score 1924; DB 16; Length 390; 

Best Local Similarity 91.7%; Pred. No. 4.4e-163; 

Matches 365; Conseirvative 10; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIIIMIIIMMIIIM IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

IIIIMHIIMIIIIMIIMIMIIIIIII IIIIMIIIMIIMMIIIIh hi 

Db 61 SPPSQGEVPPGPLPEAVIiALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIhlll 
Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

IIIMIIIMIIIIIIIIIMIMIMhl lllllllll I Ihllll hllll 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

UIIIIIIIIMIIIIIIIIIIIIIIIIIII llllll Illllillllllll 
Db 241 TGRKGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 2 92 

Qy 301 CCVRQLYIDFRKDLGVJKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

II IIIIIIIMIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIMIIMIIII 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPTVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIMII 
Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 8 
AAR90827 

ID AAR90827 standard; Protein; 390 AA. 
XX 

AC AAR90827; 
XX 

DT 25-MAR-2003 (updated) 

DT 25-JAN-1980 (first entry) 

XX 

DE Pre-transforming growth factor beta 1. 
XX 

KW transforming growth factor beta 1; wound healing; 

KW recombinant production. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT Domain 8 . . 23 

FT /note= "hydrophobic domain" 

FT Modif ied-site 82.. 84 

FT /note= "potential N-glycosylation site" 

FT Modif ied-site 136.. 138 

FT /note= "potential N-glycosylation site" 

FT Modif ied-site 176.. 178 

FT /note= "potential N-glycosylation site" 

FT Cleavage-site 277.. 279 

FT /note= "trypsin-like peptidase cleavage site" 

FT Protein 279 . .390 

FT / label = mature_TGF_beta_l 

XX 

PN US5482851-A. 

XX 

PD 09-JAN-1996. 
XX 

PF 05-NOV-1993; 93US - 0147364 . 
XX 

PR 13-MAR-1987; 87US-0025423 . 
PR 22-MAR-1985; 85US-0715142 . 
PR 04-AUG-1989; 89US-0389929 . 
PR 04-MAR-1992; 92US- 084 5893 . 
PR 05-NOV-1993; 93US- 01473 64 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1996-076891/08. 
DR N-PSDB; AAT15720. 
XX 

PT New recombinant human transforming growth factor-beta prods. - produced 
PT using Chinese hamster ovary cells, for use in diagnostic applications 
PT or in therapy 
XX 

PS Example 3; Fig lA-C; 26pp; English. 
XX 



CC The pre-transforming growth factor (TGF) beta 1 protein is encoded 

CC by AAT15720. The mature TGF beta 1 monomer is cleaved from the 

CC precursor at the Arg-Arg dipeptide immediately preceding the mature 

CC TGF-beta 1 NH2- terminus . It does not contain a recognisable N-terminal 

CC signal peptide typical of most secreted proteins. The pre-TGF beta 1 

CC contains several pairs of basic residues which could undergo 

CC post-translational cleavage and give rise to separate polypeptide 

CC entities. The precursor contains 3 potential N-glycosylation sites, none 

CC of which are localised in the mature TGF beta 1. This is useful in 

CC purification of the mature protein. TGF beta 1 can be used in, e.g. wound 

CC healing. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 3 90 AA; 

Query Match 91.0%; Score 1924; DB 17; Length 3 90; 

Best Local Similarity 91.7%; Pred. No. 4.4e-163; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIMIIIIMIIIIIIIIII IIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhllllllllllllllMIIIIIIIII llllllllllllllllllllll: hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

INN : Ihll IIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIh-lll 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

Illllllllllllllllllllllllllhl lllllllll I Ihllll hllll 
Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

HIIIIIIIIIIIIMIIIIIIIIIIIIIII mill IIIMIIIIIIIII 
Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKM 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIIIIIIIIIIIMIilllllllllllllllll 
Db 3 53 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 9 
AAU77101 

ID AAU77101 standard; Protein; 390 AA. 
XX 

AC AAU77101; 
XX 

DT 05-JUN-2002 (first entry) 
XX 

DE Human transforming growth factor beta 1 (TFG-beta-1) polypeptide. 



XX 

KW Human; transforming growth factor beta; TGF-beta; insulin production; 

KW type I diabetes mellitus; pancreatic cell outgrowth; wound healing; 

KW pancreatic duct tissue; ischaemia; stroke; nervous system aging; 

KW neurological condition; neurodegenerative disease; inflammation; 

KW vasal injury; chemical injury; traumatic injury; tumour- induced injury; 

KW amyotrophic lateral sclerosis; spinocerebellar degeneration; 

KW immunological disease; multiple sclerosis; TGF-beta-1. 

XX 

OS Homo sapiens. 
XX 

PN W0200212336-A2. 
XX 

PD 14-FEB-2002. 
XX 

PF 09-FEB-2001; 2001WO-US04 192 . 
XX 

PR 09-AUG-2000; 2000US-0635368 . 
XX 

PA (CURI-) CURIS INC. 
XX 

PI Wang M, Pang K; 
XX 

DR WPI; 2002-257468/30. 
XX 

PT Treating a subject with a disorder resulting from insufficient insulin 

PT production, and inducing outgrowth of pancreatic cells, involves using 

PT a transforming growth factor beta therapeutic - 
XX 

PS Disclosure; Fig 1; 77pp; English. 
XX 

CC The invention relates to treating a subject with a disorder resulting 

CC from insufficient insulin production, involving contacting the subject 

CC with a transforming growth factor beta (TGF-beta) therapeutic. TGF-beta 

CC polypeptides can be used for treating a subject with a disorder resulting 

CC from insufficient insulin production, e.g. type I diabetes mellitus, and 

CC for inducing outgrowth of pancreatic cells associated with pancreatic 

CC duct tissue within a subject. A composition comprising a TGF-beta protein 

CC may be useful in wound healing and treatment of neurological conditions 

CC derived from acute, subacute or chronic injury to the nervous system, 

CC including traumatic injury, chemical injury, vasal injury and deficits 

CC (such as ischaemia resulting from stroke) , together with 

CC infectious/inflammatory and tumour- induced injury, aging of the nervous 

CC system including Alzheimer's disease, chronic neurodegenerative diseases 

CC including Parkinson's disease, Huntington's chorea, amyotrophic lateral 

CC sclerosis, spinocerebellar degenerations and chronic immunological 

CC diseases of the nervous system or affecting the nervous system, including 

CC multiple sclerosis. This sequence represents the human TGF-beta-1 

CC protein. 

XX 

SQ Sequence 3 90 AA; 



Query Match 90.9%; Score 1921; DB 23; Length 390; 

Best Local Similarity 91.7%; Pred. No. 8.1e-163; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 



Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


293 


Qy 


361 


Db 


353 



I Mlllll IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 
SPPSQGDVPPGPLPEAVliALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

IIIMhIIIIIMIIIIIIIIMIIIIIIM IIIIIIIIIIIIIIIMIIIIh hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 



lllllllllllhl lllllllll I I 



IIIIIIIIIIIIIIIIIIIIIIIIIMMI llllll IIMIIIIMIIII 

^GRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 2 92 



IMIIIIIMIIIIIIIIMIIMMIIIIIMIIMIMIIIIIMIIIU 

CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVliALYN( 

APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 
IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 



RESULT 10 
AAE16943 

ID AAE16943 standard; Protein; 390 AA. 
XX 

AC AAE16943; 
XX 

DT 18-APR-2002 (first entry) 
XX 

DE Human transforming growth factor-betal (TGF-betal) protein. 
XX 

KW Human; transforming growth factor-betal; TGF-betal; osteoporosis; 

KW latency associated peptide; LAP; integrin alphavbeta3; apoptosis; 

?CW immunomodulation; inflammatory disease; fibrotic disease; cancer; 

KW diabetic retinopathy; chronic obstructive pulmonary disorder; 

KW bone resorption; rheumatoid arthritis; psoriasis; restenosis; 

KW atherosclerosis; liver fibrosis; asthma; cytostatic; osteopathic; 

KW ophthalmological ; antiarteriosclerotic; vasotropic. 
XX 

OS Homo sapiens . 



XX 

FK Key Location/Qualifiers 

FT Peptide 1 . . 29 

FT /label= Signal_peptide 

FT Region 30. .2 78 

FT /note= "LAP-betal" 

FT Domain 244 . . 246 

FT /note= "RGD motif" 

FT Protein 279. .390 

FT /note= "Human mature TGF-betal protein" 



XX 

PN WO200190186-A1. 
XX 

PD 29-NOV-2001. 
XX 

PF 25-MAY-2001; 2001WO-GB02352 . 
XX 

PR 26-MAY-2000; 2000GB- 001299 1 . 

PR 05-JAN-2001; 2001GB-0000286 . 
XX 

PA (GLAX ) GLAXO GROUP LTD. 
XX 

PI Ludbrook S, Barry S, Morgan C, Miller D; 
XX 

DR WPI; 2002-097645/13. 
XX 

PT Identifying modulators of interactions between latency associated 

PT peptides and integrin alphavbetaB for therapeutics, by contacting the 

PT peptide and integrin with a test product and determining if the product 

PT modulates interaction - 
XX 

PS Disclosure; Page 37-39; 44pp; English. 
XX 

CC The invention relates to a method for identification of a modulator 

CC of the interaction between latency associated peptide (LAP) of 

CC transforming growth factor-betal (TGF-betal) and integrin alphavbeta3, 

CC The method is useful for identifying a modulator of the interaction 

CC between LAP and integrin alphavbeta3 . The method is useful for 

CC immunomodulation, in the treatment of inflammatory disease, fibrotic 

CC disease, cancer, diabetic retinopathy, bone resorption or osteoporosis, 

CC and for preventing apoptosis administering the modulator to the host. 

CC The modulator (inhibitor of the interaction between LAP-betal and 

CC integrin alphavbeta3) is useful in the manufacture of a medicament for 

CC immunomodulation. The modulator (activator of the interaction between 

CC LAP-betal and integrin alphavbeta3 ) is useful in the manufacture of 

CC medicament for preventing apoptosis. The modulator is useful for 

CC treating a inflammatory or fibrotic disease such as chronic obstructive 

CC pulmonary disorder, rheumatoid arthritis, psoriasis, restenosis, 

CC atherosclerosis, liver fibrosis and asthma. The present sequence is 

CC human TGF-betal protein. 

XX 

SQ Sequence 390 AA; 

Query Match 90.9%; Score 1921; DB 23; Length 390; 
Best Local Similarity 91.7%; Pred. No. 8.1e-163; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 



Db 



1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 



Qy 



61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 



Db 




Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

IMII : Ihll IIIIIMIIIIIIIIIIIIIIMIIMIIIIIIIIIMIIIhlll 



Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

I Ml I II MMIMM Mil III III II-- 1 MM II III I MM II I IMIM 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

M M M I M M M M M M M M M M M M M M M M M M M M M M 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 292 

Qy 3 01 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 60 

IMMMMIIIIIIIIIIIIMIIIMI MMMMIMMMMMMMMMMM 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MMMMMMMMIMMMMMMMMMMI 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 11 
ABB82780 

ID ABB82780 Standard; Protein; 391 AA. 
XX 

AC ABB8278 0; 
XX 

DT 18-MAR-2003 (first entry) 
XX 

DE TGFBl Arg25Pro polymorphism G-allele protein sequence. 
XX 

KW Polymorphism; transforming growth factor beta 1; TGFBl; TGFbeta-1; 

KW renal failure; nephrotropic; human; allele. 

XX 

OS Homo sapiens. 
XX 

PN WO200290585-A2 . 
XX 

PD 14-NOV-2002. 
XX 

PF 08-MAY-2002; 2002WO-GB02066 . 
XX 

PR 09-MAY-2001; 2001GB-0011277 . 
XX 

PA (UYSH-) UNIV SHEFFIELD HALLAM. 
XX 

PI El-Nahas AM, Blakemore A, Khalil MS; 
XX 

DR WPI; 2003-120560/11. 
DR N-PSDB; ABV75391. 
XX 

PT Determining an individual's susceptibility to the progression of renal 
PT failure comprises detecting the presence of a genetic polymorphism 
PT pattern m transforming growth factor beta 1 (TGFBl) gene in a sample 
PT from the individual 
XX 

PS Claim 51; Page 59-61; 62pp; English. 
XX 

CC The invention relates to determining an individual's susceptibility to 



CC renal failure and invovles detecting the presence of a genetic 

CC polymorphism pattern in transforming growth factor beta 1 (TGFBl) gene in 

CC a sample from an individual, where polymorphism pattern is associated 

CC with renal failure. The method is useful for determining an individual's 

CC susceptibility to the progression of renal failure. The nucleic acid 

CC comprising a T{-509)C polymorphism of TGFBl gene, or a polypeptide 

CC comprising a sequence of 391 amino acids is useful for preparing a 

CC medicament for retarding or preventing the progression of renal disease, 

CC and for drug research puirposes for retarding or preventing the 

CC progression of renal disease. Sequences ABV75386-88 represents the 

CC protein sequence for the TGFBl G-allele of the Arg25Pro polymorphism 

CC of exon 1 . 

XX 

SQ Sequence 391 AA; 

Query Match 90.8%; Score 1920.5; DB 24; Length 3 91; 

Best Local Similarity 91.7%; Pred. No. 9e-163; 

Matches 366; Conservative 10; Mismatches 14; Indels 9; Gaps 2; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIIIIIMIIIIIIIIIIIMMIIIIIIIMMIIIIIMMIIMIIIIIII 

Db 1 MP PSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQ I LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhlllllllllllllMIIIIIIIIII IMIIIIIIMIIMIIIIIIh hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 179 

Mill : Ihll IMIIIIIIIIIIMIIIIIIIII lllllllllllllllllhll 
Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 180 

Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGF 239 

IMMIMIIIMIIMIIMIIIIMIhl MIIIMM I I h- M II hill! 

Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 240 

Qy 24 0 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEK 299 

UIIIIIIIIIIIIIIIIIIMIIIIIIIM mill IIIIIIIIMIII 

Db 241 TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEK 292 

Qy 300 NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 359 

IIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

Db 293 NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 3 52 

Qy 360 AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIMIIIIIIIIIMIIIIMIIMIIIIIIIIIMIII 

Db 353 AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 391 



RESULT 12 
AAR20124 

ID AAR20124 standard; Protein; 390 AA. 
XX 

AC AAR20124; 
XX 

DT 25-MAR-2003 (updated) 

DT 16-APR-1992 (first entry) 

XX 



DE Sequence of simian transforming growth factor (TGF) beta-1. 
XX 

KW Hypertension therapy; hypotensive agent; blood pressure modulator. 
XX 

OS Monkey . 
XX 

FH Key 

FT Peptide 

FT Protein 
XX 

FN W09119513-A. 
XX 

PD 26-DEC-1991. 
XX 

PF 20-JUN-1991; 
XX 

PR 20-JUN-1990; 
XX 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 
XX 

PI Oleson FB, Comereski CR; 
XX 

DR WPI; 1992-024199/03. 

DR N-PSDB; AAQ2028 9. 
XX 

PT Use of transforming growth factor (TGF) -beta and their 

PT antagonists - for modulating blood pressure, for treating 

PT hypertension and hypotension 
XX 

PS Disclosure; Fig 1; 42pp; English. 
XX 

CC A new method for treating hypertension comprises administering a 

CC transforming growth factor (TGF) -beta to an individual at a dose 

CC effective for lowering blood pressure; the TGF-beta may be e.g. 

CC mature TGF-beta, TGF-beta2, a mature TGF-betal/beta2 hybrid, TGF- 

CC betal precursor, a latent TGF-beta2 precursor, hybrid TGF-betal/TGF- 

CC beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 

CC complex. 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 390 AA; 



Location/Qual if iers 
8 . .21 
279. .390 



91WO-US04449. 
90US-0541221. 



Query Match 90.8%; Score 1919; DB 13; Length 390; 

Best Local Similarity 91.5%; Pred. No. 1.2e-162; 

Matches 364; Conservative 10; Mismatches 16; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIIIIIIIIMIIIMI IIIMMMIMIIMIMIIM IIIIIIIMIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTIDMELVKRKRIETIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

IIIIMUIIIMIIIIIIIIIIIIIMIIM IIIMIMIIMMIIIIIIII: hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 



Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

HIM : Ihll IIMIIIIMMMIIIIIIIIIIIIMMIIIIIIIIIIIhlll 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 



Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

IIIIIIIII|:||||IMMIIIIIII|:| I III I I II I I MM III hi III 
Db 181 YLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSKDNTLQVDINGFT 24 0 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 3 00 

MMIIIIIIIIIIIIMIIIMIMIMM IIIIM IIIIMMMIIM 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

MMIIIIIIIII IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIMIMIl 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 
Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 13 




AAR46227 




ID 


AAR46227 standard; Protein; 390 AA. 


XX 






AC 


AAR46227; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


09-JUL-1994 


(first entry) 


XX 






DE 


Human pre-TGF 


-beta-1 . 


XX 






KW 


TGF-beta-1; TGF-beta-2; transforming growth factor het< 


KW 


transforming growth factor beta -3; recombinant; wound '. 


KW 


vulnerary. 




XX 






OS 


Homo sapiens . 




XX 






FH 


Key 


Location/Qualifiers 


FT 


Peptide 


279. .390 


FT 




/label = Mat_peptide 


FT 


Cleavage-site 


279 


FT 




/note= "TGF-beta-1 release site" 


FT 


Modif ied-site 


82. .84 


FT 




/label= N-glycosylation site 


FT 


Modif led-site 


136 . . 138 


FT 




/label = N-glycosylation site 


FT 


Modif ied-site 


176. . 178 


FT 




/label= N-glycosylation_site 


XX 






PN 


US5284763-A. 




XX 






PD 


08-FEB-1994 . 




XX 






PF 


04-MAR-1992; 


92US-0845893. 


XX 






PR 


22-MAR-1985; 


85US-0715142. 


PR 


13-MAR-1987; 


87US-0025423 . 


PR 


04-AUG-1989; 


89US-0389929. 


PR 


04-MAR-1992; 


92US-0845893. 



XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynk RMA, Goeddel DV; 
XX 

DR WPI; 1994-056343/07. 

DR N-PSDB; AAQ56923 . 
XX 

PT Nucleic acid sequences encoding transforming growth factor-beta - 

PT diagnostic probes, and for use in therapeutics 

XX 

PS Disclosure; Fig lb; 25pp; English. 
XX 

CC cDNA sequences were determined for human pre-TGF-beta-1 (AAQ56923) , 

CC pig TGF-beta-3 (AAQ56925) and human TGF-beta-3 {AAQ56 926) , and the 

CC corresponding amino acid sequences were determined (AAR46227-29 , 

CC respectively) . A genomic fragment corresponding to a human TGF- 

CC beta-1 exon (AAQ56924) was also isolated and its amino acid sequence 

CC determined (AAR4623 0) . The sequences have been used in the 

CC construction of vectors for the expression of recombinant TGF- 

CC beta . 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 3 90 AA; 

Query Match 90.6%; Score 1915; DB 15; Length 390; 

Best Local Similarity 91.5%; Pred. No. 2.8e-162; 

Matches 364; Conservative 10; Mismatches 16; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIIIIIIIIIMIIMII MIMMIIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

MM Ihlli! Ill II II INI I II I Ml II I I IMIMIIIIMMIIIIIM- hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEREPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

MMI : IhM MIMMIIMMIMMIIIMMIMMIMIMMIMhIII 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

MMIIIIMMIIIIMMMMMIhl IMMIMI I Ihllll hIMI 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

MIIMMMMMIMIMIIMIIIMII IIIMI IMMIIMMIM 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IMMMIMMMIIIIMMIIM IIIIMIIMIMIMIMIIMM MIIIMII 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MIIIIMMMMIMMMIMMMMMMMII 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 14 
AAR83054 

ID AAR83054 standard; Protein; 391 AA. 
XX 

AC AAR83 054; 
XX 

DT 25-JUN-1996 (first entry) 
XX 

DE Transforming growth factor-beta 1. 
XX 

KW macrophage inducible nitric oxide synthase; iNOS; constitutive NOS; 

KW interleukin-l-beta; transforming growth factor-beta; TGF-beta; ILl-beta; 

KW nitric oxide production; hypotension; inflammation; septic shock; 

KW treatment . 

XX 

OS Mammalian sp. 
XX 

FH Key Location/Qualifiers 
FT Protein 279. .391 

FT /note= "represents the mature active TGF beta-1 mol . " 

XX 

PN W09526745-A1 . 
XX 

PD 12-OCT-1995. 
XX 

PF 05-APR-1994; 94WO-US03705 . 
XX 

PR 05-APR-1994; 94WO-US03 705 . 
XX 

PA (HARD ) HARVARD COLLEGE. 
XX 

PI Lee M, Perrella MA; 
XX 

DR WPI; 1995-358443/46. 
DR N-PSDB; AAT05876. 
XX 

PT Treatment of hypotension, esp. in septic shock - by administering 

PT transforming growth factor-beta e.g. to inhibit inducible nitric 

PT oxide synthase gene transcription 

XX 

PS Disclosure; Fig 17; 52pp; English. 
XX 

CC Transforming growth factor-beta 1 (TGF-beta 1) has been found to inhibit 

CC inducible nitric oxide synthase (iNOS) gene transcription, esp. in 

CC interleukin-l-beta (ILl-beta) stimulated rat smooth muscle cells , and at 

CC a dose which does not inhibit consitutive NOS. TGF-beta 1 or 2 

CC (AAR83055) or their active fragments (esp. derived from the 

CC carboxy- terminal 112 amino acids) , can be used in the treatment of 

CC hypotension, such as that associated with severe inflammation or septic 

CC shock . 

XX 

SQ Sequence 3 91 AA; 



Query Match 90.5%; Score 1913.5; DB 16; Length 391; 

Best Local Similarity 91.5%; Pred. No. 3,8e-162; 

Matches 365; Conservative 10; Mismatches 15; Indels 9; Gaps 



2; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVFCRKRIEAIRGQILSKLRLA 60 

I IIIIIIIIIIMMIIIIMII IIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVIiALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

MlllhlllllllllllMMIIMIIIIII IIIIIIIIIIIIIIIIIMMh hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLPCLKVEQHVELYQKYSNDSW 179 

Mill : Ihll II II MINI Mini III Mill IMIIIIlllllllllllMI 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 18 0 

Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGF 239 

IIMMIIIMIIIIMMMMIMIIhl IIIIIMII I IhMII hIMI 

Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 24 0 

Qy 240 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEK 299 

MIIMIMIIMIIMIIIIMIMMIII IIIMI MIIMIIIMII 

Db 241 TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEK 2 92 

Qy 300 NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 359 

IIIIIIMMMIIIIMIMIIIMMIIMIIIMIMIIIIIIMMIMMIIIII 

Db 2 93 NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 3 52 

Qy 360 AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IMIIIIIIIIIIIIMIMIIMIIMIIMIM MM 

Db 3 53 AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 391 



RESULT 15 




AAW78785 




ID 


AAW78785 Standard; Protein; 390 AA. 


XX 






AC 


AAW78785; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


21-DEC-1998 


(first entry) 


XX 






DE 


Human pre-transf orming growth factor-beta 1. 


XX 






KW 


Transforming growth factor-beta 1; TGF-beta 1; human 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Locat ion/Qual i f iers 


FT 


Domain 


8 . -23 


FT 




/note= "hydrophobic domain" 


FT 


Protein 


213. .390 


FT 




/label= Mat_protein 


FT 


Modif ied-site 


82.-84 


FT 




/note= "Asn is N-glycosylated" 


FT 


Modif ied-site 


136. .138 


FT 




/note= "Asn is N-glycosylated" 


FT 


Modif ied-site 


176 . . 178 


FT 




/note= "Asn is N~glycosylated" 


FT 


Cleavage-site 


277. .278 



FT /note= "cleavage site for relase of TGF-beta 1" 

XX 

PN US5801231-A. 
XX 

PD Ol-SEP-1998. 
XX 

PF 30-MAY-1995; 95US-0454468 . 
XX 

PR 13-MAR-1987; 87US- 0025423 . 

PR 22-MAR-1985; 85US-0715142 . 

PR 04-AUG-1989; 89US-0389929 . 

PR 04-MAR-1992 ; 92US - 084 58 93 . 

PR 05-NOV-1993; 93US- 014 7364 . 

PR 30-MAY-1995; 95US- 04 54468 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1998-494840/42. 

DR N-PSDB; AAV52933. 
XX 

PT DNA encoding transforming growth factor-beta precursor sequence - 

PT useful for analysis to perform manipulations to increase yield of 

PT recombinant production of the protein 
XX 

PS Example 3; Fig IB 1-3; 26pp; English. 
XX 

CC This is the amino acid sequence of human transforming growth 

CC factor-beta 1 precursor (preTGF-beta 1) . It was deduced from 

CC a preTGF-beta 1 cDNA sequence (see AAV52933) . The invention relates 

CC to the recombinant production of TGF-beta. Biologically active 

CC TGF-beta is defined as being capable of inducing EGF -potentiated 

CC anchorage independent growth of target cell lines and/or growth 

CC inhibition of neoplastic cell lines. Nucleic acids encoding 

CC TGF-beta have been isolated and cloned into vectors which are 

CC replicated in bacteria and expressed in eukaryotic cells. TGF-beta 

CC recovered from transformed cells is used in known therapeutic 

CC applications. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 390 AA; 

Query Match 90.5%; Score 1913; DB 19; Length 390; 

Best Local Similarity 91.2%; Pred. No. 4.2e-162; 

Matches 363; Conservative 10; Mismatches 17; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I iliiMiiiiiiliiiiiMii li I Ml II MM I MMMIIMMMIMIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAPGLSTCKTIDMEQVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 1 1 h 1 1 1 1 1 1 II M M M M M 1 1 1 M M M M M II I II M M M M M I : hi 

Db 61 SPPSQGEVPPGPLPEAVLALyNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll IIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIhlll 



Db 



121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 



Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

IIIIIIIIIIIIIIIIIIMIIIIIIM:| lllllllll I Ihllll hllll 
Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

UIIIIIIIIIIIIIMIMIIIIIIIMII MUM MMMMMMM 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

MMMMMMMMMMMMMMNMMMMMMMMMMMMMMM 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MMMMMMMMIMMMMMMMIMMM 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



Search completed: October 28, 2003, 09:06:45 
Job time : 36.2026 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



October 28, 2003, 09:09:54 ; Search time 24.1473 Seconds 

(without alignments) 
2760.110 Million cell updates/sec 

US-10-017-372E-9 
2114 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNMI VRSCKCS 3 98 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



629382 seqs, 167460630 residues 



Total number of hits satisfying chosen parameters: 



629382 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



Publ ished_Appl icat ions_AA : * 

1 : /cgn2_6/pcodata/2/pubpaa/US07_PUBC0MB.pep: * 

2 : /cgn2__6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep: * 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep: * 

6: /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep:* 



7 : /cgn2_6 /ptodata /2 /pubpaa /US 0 8_NEW_PUB . pep : * 

8 : /cgn2_6/ptodata/2 /pubpaa /US08_PUBCOMB .pep : * 

9 : /cgn2_6/ptodata/2/pubpaa/US09A_PUBC0MB.pep:* 
10: /cgn2__6/ptodata/2/pubpaa/US09B_PUBCOMB.pep:* 
11 : /cgn2__6/ptodata/2 /pubpaa/US09C_PUBCOMB . pep : * 
12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: * 
13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep: * 
14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep: * 
15 : /cgn2_6/ptodata/2/pubpaa/US10C_PUBC0MB .pep : * 

16 : /cgn2__6/ptodata/2/pubpaa/US10_NEW_PUB.pep: * 

17 : /cgn2__6/ptodata/2/pubpaa/US50_NEW_PUB .pep : * 
18 : /cgn2_6/ptodata/2/pubpaa/US6 0_PUBCOMB.pep:* 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No . 


Score 


Query 
Match 


Length 


DB 


ID 








Description 




1 


2035 


96 


3 


390 


11 


US- 


09 


-214 


-592-26 


Sequence 


26, 


Appl 


2 


1934 


91 


5 


390 


11 


us- 


09 


-214 


-592-29 


Sequence 


29, 


Appl 


3 


1931 


91 


3 


390 


15 


us- 


10 


-087 


-268-2 


Sequence 


2, 


Appli 


4 


1924 


91 


0 


390 


15 


US- 


10 


-087 


-268-5 


Sequence 


5, 


Appli 


5 


1921 


90 


9 


390 


12 


US- 


10 


-276 


-947-1 


Sequence 


1, 


Appli 


6 


1919 


90 


8 


390 


11 


US- 


09 


-214 


-592-33 


Sequence 


33, 


Appl 


7 


1913 . 5 


90 


5 


391 


11 


us- 


09 


-214 


-592-17 


Sequence 


17, 


Appl 


8 


1910 


90 


4 


390 


10 


US- 


09 


-756 


-283A-23 


Sequence 


23, 


Appl 


9 


1898 


89 


8 


390 


11 


US- 


09 


-214 


-592-28 


Sequence 


28, 


Appl 


10 


1840 


87 


0 


390 


11 


us- 


09 


-214 


-592-20 


Sequence 


20, 


Appl 


11 


1840 


87 


0 


390 


11 


us- 


09 


-214 


-592-23 


Sequence 


23, 


Appl 


12 


1577 


74 


6 


315 


11 


us- 


09 


-214 


-592-25 


Sequence 


25, 


Appl 


13 


1262 .5 


59 


7 


455 


10 


us- 


09 


-756 


-283A-20 


Sequence 


20, 


Appl 


14 


1139 


53 


9 


447 


10 


us- 


09 


-756 


-283A-22 


Sequence 


22, 


Appl 


15 


1059.5 


50 


1 


373 


11 


us- 


09 


-214 


-592-32 


Sequence 


32, 


Appl 


16 


876 


41 


4 


412 


11 


us- 


09 


-214 


-592-31 


Sequence 


31. 


Appl 


17 


873 


41 


3 


382 


11 


us- 


09 


-214 


-592-34 


Sequence 


34, 


Appl 


18 


872.5 


41 


3 


383 


10 


us- 


09 


-756 


-283A-27 


Sequence 


27, 


Appl 


19 


838.5 


39 


7 


409 


11 


us- 


09 


-214 


-592-27 


Sequence 


27, 


Appl 


20 


835 


39 


5 


410 


11 


us- 


09 


-214 


-592-22 


Sequence 


22, 


Appl 


21 


834 


39 


5 


412 


11 


us- 


09 


-214 


-592-24 


Sequence 


24, 


Appl 


22 


832 


39 


4 


412 


11 


us- 


09 


-214 


-592-19 


Sequence 


19, 


Appl 


23 


832 


39 


4 


412 


14 


us- 


10 


-028 


-158-21 


Sequence 


21, 


Appl 


24 


831.5 


39 


3 


414 


11 


us- 


09 


-214 


-592-21 


Sequence 


21, 


Appl 


25 


828 


39 


2 


412 


10 


us- 


09 


-756 


-283A-25 


Sequence 


25, 


Appl 


26 


816. 5 


38 


6 


414 


10 


us- 


09 


-756 


-283A-24 


Sequence 


24, 


Appl 


27 


81b . b 


38 


6 


414 


11 


us- 


09 


-214 


-592-18 


Sequence 


18, 


Appl 


28 


815.5 


38 


6 


412 


11 


us- 


09 


-214 


-592-30 


Sequence 


30, 


Appl 


29 


804 .5 


38 


1 


304 


10 


us- 


09 


-756 


-283A-26 


Sequence 


26, 


Appl 


30 


758 


35 


9 


139 


14 


us- 


10 


-002 


-278-8 


Sequence 


8, 


Appli 


31 


642 


30 


4 


115 


10 


us- 


09 


-859 


-211-47 


Sequence 


47, 


Appl 


32 


642 


30 


4 


115 


10 


us- 


09 


-880 


-708-25 


Sequence 


25, 


Appl 


33 


642 


30 


4 


115 


11 


us- 


09 


-872 


-856-47 


Sequence 


47, 


Appl 


34 


642 


30 


4 


115 


15 


us- 


10 


-335 


-483-29 


Sequence 


29, 


Appl 


35 


640 


30 


3 


114 


10 


us- 


09 


-813 


-459-22 


Sequence 


22, 


Appl 



6 o 




J u 






1 A 




1 (\ 


lie 

1 1 D 


A(\C 0 1 




^ J. , -"-PP 


"^7 
o / 


64 0 




-x 


i. J- M 


1 R 




10- 


154 








38 


638 


30 


2 


112 


10 


US- 


09- 


813 


-271B-2 


Sequence 


2, Appli 


39 


638 


30 


2 


113 


10 


US- 


09- 


813 


-398-13 


Sequence 


13, Appl 


40 


562 


26 


6 


98 


12 


US- 


10- 


187 


-394-1 


Sequence 


1, Appli 


41 


544 


25 


.7 


116 


14 


us- 


10- 


115 


-406-24 


Sequence 


24, Appl 


42 


544 


25 


7 


116 


15 


us- 


10- 


154 


-333-26 


Sequence 


26, Appl 


43 


535 


25 


3 


112 


10 


us- 


09- 


813 


-271B'8 


Sequence 


8, Appli 


44 


505 


23 


9 


114 


14 


us- 


10- 


115 


-406-25 


Sequence 


25, Appl 


45 


505 


23 


9 


114 


15 


us- 


10- 


154 


-333-27 


Sequence 


27, Appl 



ALIGNMENTS 



RESULT 1 

US-09-214-592-26 

Sequence 26, Application US/09214592A 
Publication No. US20030027218A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 



Yamasaki , CMotoo 
Shibata, CKenj i 
Sato, CYasufumi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 26 
LENGTH: 390 
TYPE: PRT 
ORGANISM: porcine 
US-09-214-592-26 

Query Match 96.3%; Score 2035; DB 11; Length 390; 

Best Local Similarity 97.0%; Pred . No. 1.9e-182; 

Matches 386; Conservative 1; Mismatches 3; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IMIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIhllllll 

Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMLESGNQI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

lllllllllllilllllllllllMlllllllllllMMIMIIIIIIIIIIIIMIII 

Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

IIIIIMIIMIIIIIIMMMIIIMMMIIMIIMM I IIIMMIMIMII 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 24 0 



Qy 
Db 

Qy 
Db 

Qy 
Db 



241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 3 00 

IIINIIIIIMIIUIMIIMIIIIIIIIIMMM IMIIIMMMII 

241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 2 92 

301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIIIMIIIMIIIIIIIllllllllllllllllllllMIIIIIIIIIIIIIIIIIIII 

2 93 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 
361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

IIIMIIIIIIMIIIIIMIIIMMIIIIIMIIII 

353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 2 

US-09-214-592-29 

Sequence 29, Application US/09214592A 
Publication No. US20030027218A1 
GENERAL INFORMATION: 
APPLICANT: Yamasaki, CMotoo 
APPLICANT : Shibata , CKenj i 
APPLICANT: Sato,CYasufumi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 29 
LENGTH: 3 90 
TYPE: PRT 
ORGANISM: ovine 
US-09-214-592-29 



Query Match 91.5%; Score 1934; DB 11 

Best Local Similarity 91.7%; Pred. No. 5.9e-173 
Matches 365; Conservative 10; Mismatches 15 



Length 390; 
Indels 8; Gaps 



1; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 



I llllllllllllllllhllllll lllllllllllllllll lllllllllllllll 

MP PSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKT I DMELVKRKG I EAI RGQI LSKLRIA 

SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 
lllllllllllllllhlllllllllllllll I lllllllllillllllllll 



llllllillllllllllllllllllllil II lilllllM I lllllll hillh 
YLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEI EGFRLSAHCSCDSKDNTLQVDI NGFS 

SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 

IIIIIIMIIIIIillMIIIIIIIIIMIIIIIIMI llllllllllllll 



Db 241 SGRRGDIATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 292 

Qy 3 01 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPyiWSLDTQYSKVLALYNQHNPGASA 360 

IIMIMIIIIIIIIIIIIIIIIMIMIIIIIIIIIIMIIMIIIIIIIIIIIIIIII 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMII 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 3 
US-10-087-268-2 

; Sequence 2, Application US/10087268 

; Publication No. US20030119010A1 

; GENERAL INFORMATION: 

; APPLICANT: JoHSonn, Julie Ruth 

; APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

; FILE REFERENCE: Fibrosis 

; CURRENT APPLICATION NUMBER: US/10/087,268 
; CURRENT FILING DATE: 2002-03-01 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 

LENGTH: 390 

TYPE : PRT 

ORGANISM: Human 
US-10-087-268-2 



Query Match 91.3%; Score 1931; DB 15; Length 390; 

Best Local Similarity 92.0%; Pred. No. l.le-172; 

Matches 366; Conservative 10; Mismatches 14; Indels 8; Gaps 1; 



Qy 


1 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQI LSKLRLA 

1 IIIIMIIIIIIIIIMIIIIIIMIIIIMIIIMIIIIIIIIIIIIIIIIIIMII 

MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 


60 


Db 


1 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

lllllhlllllllllllllllllllllllll Mllllllllllllllllllll: hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Db 


61 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

Mill : Ihll IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMhIII 


180 


Db 


121 


YDKFKQSTHS I YMFFNTSELREAVPEP VLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTG WRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEI NGFW 

lijilMijlllllllillllllilllhl III II INI 1 Ihllll hllll 

YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 


24 0 


Db 


181 


240 


Qy 


241 


SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 

:|MllMIIIIIIIIIIIIIIIIIIIIIII llllll Ililllllllilll 
TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 


300 


Db 


241 


292 


Qy 


301 


CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 


360 



llllllllllllllllllllllllllllllllilllllllllllllllllllllllllll 



Db 2 93 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 



Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MIIIIIMIMIIIIIIIMIIIIIIIIIIIMMM 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 4 
US-10-087-268-5 

; Sequence 5, Application US/10087268 

; Publication No. US20030119010A1 

; GENERAL INFORMATION: 

/ APPLICANT: Jonsonn, Julie Ruth 

; APPLICANT: Powell, Elizabeth Ellen 

; TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

; FILE REFERENCE: Fibrosis 

; CURRENT APPLICATION NUMBER: US/lO/087 , 268 
; CURRENT FILING DATE: 2002-03-01 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 5 

LENGTH: 390 

TYPE : PRT 

ORGANISM: Human 
US-10-087-268-5 



Query Match 91.0%; Score 1924; DB 15; Length 390; 

Best Local Similarity 91.7%; Pred. No. 5.1e-172; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 



Qy 
Db 


1 
1 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 

1 IIIIIIIIIMIIIIIIIIIII IMIIIIIIIIIIIIIIIIIIIIIMIIMIIIM 

MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRLA 


60 
60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

IIIIMHIIIIIIIIIMIMIIIIIIIIM lllllllllllllllllllllh hi 


120 


Db 


61 


SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

Mill : Ihll IIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIhlll 

YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 


180 


Db 


121 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 


240 


Db 


181 


lllllllllllllllllllllllllllhl IIIIIIIM 1 Ihllll hill! 
YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 


240 


Qy 


241 


SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 


300 


Db 


241 


i M i i i i i i i M i M i i M 11 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 


292 


Qy 


301 


CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 

IIIIIMIIlllllllllllll llllllllll IIIIIIMIIIIIIMIMIIIIIIIII 

CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 


360 


Db 


293 


352 


Qy 


361 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 





Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 5 
US-10-276-947-1 

Sequence 1, Application US/1027694 7 
Publication No. US20030176315A1 
GENERAL INFORMATION: 
APPLICANT: GLAXO GROUP LIMITED 
TITLE OF INVENTION: MEDICAL PRODUCTS 
FILE REFERENCE: PG3 94 9 

CURRENT APPLICATION NUMBER: US/ 10/276 , 947 
CURRENT FILING DATE: 2002-11-21 
PRIOR APPLICATION NUMBER: GB 0012991.6 
PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: GB 0100286.4 
PRIOR FILING DATE: 2001-01-05 
NUMBER OF SEQ ID NOS : 1 
SOFTWARE: Patentin Ver. 2.0 
SEQ ID NO 1 
LENGTH: 3 90 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-276-947-1 

Query Match 90.9%; Score 1921; DB 12; Length 390; 

Best Local Similarity 91.7%; Pred. No. 9.8e-172; 

Matches 365; Consein/'ative 10; Mismatches 15; Indels 8; Gaps 1; 



Qy 


1 


MA PSGLRLLPLLLPLLWLLVLTPGR PAAGLSTCKT I DMELVKRKR I EA I RGQ I LS KLRLA 


60 


Db 


1 


1 lllllll IIIIIIIMMMIIMIMIIMIIIIIIIIIIIIIIIIIIIIMIMI 

MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAIRGQI LSKLRLA 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

lllllhlllllMIIIIIIIIIIIIIIIIII MIIIIIIIIIIIIIIMIIIh hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Db 


61 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 


180 


Db 


121 


Mill : Ihll IIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIhlll 

YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 

IIIIIIIMIIIIIMIIIIIIIIIIIhl IIIIIIIM 1 Ihllll hllli 
YLSNRLLAPSDS PEWLSFDVTG WRQWLSRGGE I EGFRLSAHCSCDSRDNTLQVD I NGFT 


240 


Db 


181 


240 


Qy 


241 


SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEPCN 


300 


Db 


241 


:|IIIIIIIIIIIIIIIMIIIIIIIIIIII IIIIM IMIIIIIIIIIII 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 


292 


Qy 


301 


CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 

IMIMIIIIIIIMMIMIIIMIIMIIMIIMIMIIMMIMIMIMMMI 


360 


Db 


293 


CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 


352 


Qy 


361 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

IIIIIIIIIMIIIIIIIIIIIMIIMIIIIIIIIM 




Db 


353 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 





RESULT 6 

US-09-214-592-33 

; Sequence 33, Application US/09214592A 

; Publication No. US200300272 18A1 

; GENERAL INFORMATION: 

; APPLI CANT : Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 

; APPLICANT: Sato, CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE; 11060 

; CURRENT APPLICATION NUMBER; US/09/214 , 592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 33 

LENGTH: 3 90 

TYPE: PRT 
; ORGANISM: Simian 
US-09-214-592-33 



Query Match 90.8%; Score 1919; DB 11; Length 390; 

Best Local Similarity 91.5%; Pred. No. 1.5e-171; 

Matches 364; Conservative 10; Mismatches 16; Indels 8; Gaps 1; 



Qy 


1 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 

1 IIIIIMIIIIIIIIIIIIII IIIIIIIIIIIIMIIIIIIIII llllllllllll 

MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTI DMELVKRKRI ETI RGQI LSKLRLA 


60 


Db 


1 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

lllllhlMIIMIIIIIIIIIIIIIIIIII IMIMIIIIIIIIIMIIMh hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Db 


61 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 


180 


Db 


121 


IIIM II ••II MIIMIIIIIIIIIIIIIIIIIIIIIIIII mill Mill: III 

YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 


240 


Db 


181 


III llllllhllll III nil Mill |:| lllllllll 1 III MM hlMI 

YLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSKDNTLQVDINGFT 


240 


Qy 


241 


SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 

MIIIIMIIIMMIIMNIMIIIIIII lIMM MIIMMIIIMI 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 


300 


Db 


241 


292 


Qy 


301 


CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNOHNPGASA 


360 


Db 


293 


liiMMMMMM MMMIIMIIIIIIIIMIIIIIIIIIIIMIMIIIIIIMI 

CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 


352 


Qy 


361 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 




Db 


353 


IIIIIIIIIIMIIIIIIIIMIIIIIIIIMIIIMI 

APCCVPQALEPLP I VY Y VGRKP KVEQLSNM I VRSCKCS 390 





RESULT 7 



US-09-214-592-17 

; Sequence 17, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKen] i 

; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/09/214,592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 17 

LENGTH: 391 

TYPE : PRT 

ORGANISM: human 
US-09-214-592-17 



Query Match 90.5%; Score 1913.5; DB 11; Length 391; 

Best Local Similarity 91.5%; Pred. No. 5e-171; 

Matches 365; Conservative 10; Mismatches 15; Indels 9; Gaps 2; 



Qy 


1 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 

1 llllllllllllllllllllll lllllllllllllllllllllllllllllllllll 
MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 


60 


Db 


1 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

lllllhllllMIIMIMIIMIMIIIM lllllllllllllllllllllh hi 


120 


Db 


61 


SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 


179 


Db 


121 


Mill Ihll MIIMIIIIMIMMIilllll llllllllilMlllllhM 

YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 


180 


Qy 


180 


RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGF 


239 


Db 


181 


MIIIIIIIIIIIIIIIIIIIIIMIIIhl lllllllll 1 Ihllll hllll 

RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 


240 


Qy 


240 


NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEK 

HMUMIMIIIIIIIIMIMIIIIIII IIIMI MMIIIIIMM 


299 


Db 


241 


TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEK 


292 


Qy 


300 


NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


359 


Db 


293 


NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 




Qy 


360 


AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 




Db 


353 


IMIIIMIMIIIIIIIMMIIMIIMIIIIIIIII 

AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 3 91 





RESULT 8 

US-09-756-283A-23 

; Sequence 23, Application US/09756283A 



; Patent No. US20020151478A1 

; GENERAL INFORMATION: 

; APPLICANT: Cherna j ovsky , Yuti 

; APPLICANT: Dreja, Hanna Stina 

; APPLICANT: Adams, Gillian 

; TITLE OF INVENTION: Latent Fusion Protein 

; FILE REFERENCE: 0623.1000000 

; CURRENT APPLICATION NUMBER: US/09/756 , 283A 

; CURRENT FILING DATE: 2001-01-09 

; NUMBER OF SEQ ID NOS : 100 

SOFTWARE: PatentIn version 3.0 
; SEQ ID NO 23 

LENGTH: 3 90 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-756-283A-23 



Query Match 90.4%; Score 1910/ DB 10; Length 390; 

Best Local Similarity 91.2%; Pred. No. l.le-170; 

Matches 353; Conservative 10; Mismatches 17; Indels 8; Gaps 1; 



Qy 


1 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 


60 


Db 


1 


1 llllllllllllllllllllll MIIIIIIIIIIIIIMIIMIIIIIIIIIIIIII 

MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

1 1 1 M h 1 M i 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i 1 M 1 M 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 M h : : 1 


120 


Db 


61 


SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHHEI 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

IMM : II II M IIIIIIIIIIMMIIIIIMIMIIIIIIIIIIIIIIhlll 


180 


Db 


121 


YDKFKQSTHSTYMFFNISELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 
IIIIIIIIIIIIIMMIIMIIIIII|:| lllllllll 1 I--IIM 


240 


Db 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 


240 


Qy 


241 


SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 

:||||IMIIIIIIIIIMIIIIIIIIMII llllll IIIIIIIIIIIIM 


300 


Db 


241 


TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 


292 


Qy 


301 


CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 

IMIIMIIMIIIIIIIMMtlllllllllllMIIIIIIMIIllllMIIIMIII 


360 


Db 


293 


CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVIiALYNQHNPGASA 


352 


Qy 


361 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIIIIIIIIIIIIMMMIIIIIMMIIIII 




Db 


353 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 





RESULT 9 

US-09-214-592-28 

; Sequence 28, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT : Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 



; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/ 09/2 14 , 5 92A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 28 
LENGTH: 3 90 
TYPE: PRT 
ORGANISM: canine 
US-09-214-592-28 

Query Match 89.8%; Score 1898; DB 11; Length 390; 

Best Local Similarity 90.7%; Pred. No. 1.4e-169; 

Matches 361; Conservative 12; Mismatches 17; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVJCRKRIEAIRGQILSKLRLA 60 

I llllllllllllll IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII: 
Db 1 MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLS 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

Mlllhlll IIIIIIIIIMIMIIIIMI IIMIIIMIIIIMIIIIIIh hi 

Db 61 SPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVENTNKI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

hi I :|lhllll||llllllllllllllllllllllllll llllllllllllllll 
Db 121 YEKVKKSPHSIYMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

llllllllllhllllllllMlllllh :|lllllll I lllllll hlllh 

Db 181 YLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFRLSAHCSCDSKDNTLQVDINGFS 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

I IIIIIIIIIIMIIMIIINIIMIIIIMM II IIIIIMIIIIIII 

Db 241 SSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQRR ALDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

llllllllllllllllllllllllllllllllllllll 
Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 10 
US-09-214-592-20 

; Sequence 20, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shiba ta , CKen] i 

; APPLICANT: Sato.CYasufumi 



/ TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
; CURRENT FILING DATE: 1999-01-18 
/ NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 20 

LENGTH: 3 90 

TYPE: PRT 

ORGANISM: murine 
US-09-214-592-20 

Query Match 87.0%; Score 1840; DB 11; Length 390; 

Best Local Similarity 86.9%; Pred. No. 3.9e-164; 

Matches 346; Conservative 15; Mismatches 29; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I lllllllllllll IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

lllllhlllllllllllllllllllllllll :|||llllllllllllllll|: II 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

IHI Ihll MI|::|IMIII lllllllll III III I III IIM I hi I I 
Db 121 YEKTKDISHSIYMFFNTSDIREAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

II IIM hhllllllllllllllll : : hill III I Mill IIIIIM : 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNKLHVEINGIS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

MMI Ml IMIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIIMIII 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRPCDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IMMIMMMIMMMMIMMIMMIMIIMIMIIMMIMMMIMMI 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MMIIMMIIIMIIMIMIIIIIIMMIIIMI 

Db 353 SPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 11 
US-09-2i4-592-23 

; Sequence 23, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 

; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 



/ TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY- REGULATING COMPOUNDS 

; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/0 9/2 14 , 592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE: 
/ SEQ ID NO 23 

LENGTH: 3 90 

TYPE: PRT 

ORGANISM: rat 
US-09-214-592-23 

Query Match 87.0%; Score 1840; DB 11; Length 390; 

Best Local Similarity 86.9%; Pred. No. 3.9e-164; 

Matches 346; Conservative 14; Mismatches 30; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIMMIIIIM IMMIIMIIIIIIMMIIIMIIIIIIIIIMIIMIMII 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhlllllllllMIIIMIIllMlll :|MIMIMIIIIMMIIh M 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

III I IIHI IMhUMMII MIIMIM I I IIIIIIIMIMhIl! 

Db 121 YDKTKDITHSIYMFFNTSDIREAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

II MM hMIIMMMIIMMM : : hlM Ml I Mill MMIM : 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNVLHVEINGIS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

Mill III IIIIIIIIMIIIIIIIIIIIIIIM MIIIMMIMII 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 292 

Qy 3 01 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 60 

IIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIMIIIIIII Mill MM MMIM 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 3 98 

MIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIM 

Db 353 SPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 12 

US-09-214-592-25 

; Sequence 25, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 

; APPLICANT: Sato, CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 



CURRENT APPLICATION NUMBER: US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 25 
LENGTH: 315 
TYPE: PRT 
ORGANISM: bovine 
US-09-214-592-25 

Query Match 74.6%; Score 1577; DB 11; Length 315; 

Best Local Similarity 91.3%; Pred. No. 1.4e-139; 

Matches 295; Conservative 9; Mismatches 11; Indels 8; Gaps 1; 
Qy 76 AVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLF 135 

lOIIMIIIIIMMI I I nil Mill 1 1 IN mil I hi III I : I hi I I 

Db 1 AILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKIYDKMKSSSHSIYMFF 60 

Qy 136 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

lllllllllllllllllhHIIIIIIMIMIIIIIIIIhllllllllllllllllll 

Db 61 NTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 120 

Qy 196 LSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNR 255 

IMIIIIIIIIIIIIII IIIIMIII I lllllll hlllhllMIIIIIIIIIII 

Db 121 LSFDVTGWRQWLTRREEIEGFRLSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNR 180 

Qy 256 PFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLG 315 

MMMIIIMIIMIIMIIM M M M M I M M I M 1 1 1 M M I II I M 

Db 181 PFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQLYIDFRKDLG 232 



Qy 316 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 375 

IMIIIIIMIIIMIMIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

Db 233 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 292 

Qy 376 YYVGRKPKVEQLSNMIVRSCKCS 3 98 

IIIIIIIIIIIIIIIIIIIMII 
Db 2 93 YYVGRKPKVEQLSNMIVRSCKCS 315 



RESULT 13 
US-09-756-283A-20 

; Sequence 20, Application US/09756283A 

; Patent No. US20020151478A1 

; GENERAL INFORMATION: 

; APPLICANT: Cherna j ovsky , Yuti 

; APPLICANT: Dreja, Hanna Stina 

; APPLICANT: Adams, Gillian 

; TITLE OF INVENTION: Latent Fusion Protein 

; FILE REFERENCE: 0623.1000000 

; CURRENT APPLICATION NUMBER: US/09/756 , 283A 

; CURRENT FILING DATE: 2001-01-09 

; NUMBER OF SEQ ID NOS: 100 

; SOFTWARE: Patentin version 3.0 

; SEQ ID NO 20 

LENGTH: 4 55 

TYPE : PRT 

ORGANISM: Artificial Sequence 



FEATURE : 

OTHER INFORMATION: LAP-mlFN construct 
US-09-756-283A-20 



Query Match 59.7%; Score 1262.5; DB 10; Length 455; 

Best Local Similarity 90.5%; Pred. No. 8.2e-110; 

Matches 248; Conservative 10; Mismatches 15; Indels 1; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIIIIIIIIIMIIIIM IIMIIIIIIIIIMIIIIIIIIIIIIIMIIIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

IIMIhlMIIIIIIIIIIIIIIIIIIIMI IIIIIIIIIIMIIIIIIIIIh hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 179 

Mill : Ihll IIIMIMIIIIIIIMIIMMI IMMMIIIIIIIIIIhll 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 180 

Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGF 239 

IIIIIIIIIIIIIIIIIIIIIMIIIIIhl lllllllll I Ihllll hill! 
Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 240 

Qy 240 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHS 273 

UIIIIMIIIIIIIIIIIIIIIIIIIIIII I 

Db 241 TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQS 274 



RESULT 14 
US-09-756-283A-22 

; Sequence 22, Application US/09756283A 

; Patent No. US20020151478A1 

; GENERAL INFORMATION: 

; APPLICANT: Cherna j ovsky , Yuti 

; APPLICANT: Dreja, Hanna Stina 

; APPLICANT: Adams, Gillian 

; TITLE OF INVENTION: Latent Fusion Protein 

; FILE REFERENCE: 0623.1000000 

; CURRENT APPLICATION NUMBER: US/09/756,283A 

; CURRENT FILING DATE: 2001-01-09 

; NUMBER OF SEQ ID NOS : 100 

SOFTWARE: Patentin version 3.0 
; SEQ ID NO 22 

LENGTH: 44 7 

TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: mIFN -LAP construct 
US-09-756-283A-22 

Query Match 53.9%; Score 1139; DB 10; Length 447; 

Best Local Similarity 87.0%; Pred. No. 3.1e-98; 

Matches 228; Conservative 10; Mismatches 18; Indels 6; Gaps 3; 

Qy 14 PL-LWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGP 72 

II II I II llllllllllllllllllllllllllllllllllllhlllll 



Db 



190 PLGLW AGGGSAAALSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGEVPPGP 245 



Qy 73 LPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLY 132 

IIMIIIMMIIIIIIIM lllllllllllllllllllllh hllllll : Ihl 
Db 246 LPEAVIiALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEIYDKFKQSTHSIY 305 

Qy 133 MLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSD 191 

I IIIIIIIIIIMIIIIIIIIIIII IIIIIIIIIMIIIIIIhlllllMlllllli 

Db 306 MFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSD 365 

Qy 192 SPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATIH 251 

IIIIMIIIIIIIIIIhl lllllllll I Ihllll hllll :||llllllll 
Db 366 SPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFTTGRRGDLATIH 425 

Qy 252 GMNRPFLLLMATPLERAQHLHS 273 

IIIIIIIMIIIIIIIIIII I 
Db 426 GMNRPFLLLMATPLERAQHLQS 447 



RESULT 15 

US'09-214-592-32 

; Sequence 32, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKen j i 

; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 

; FILE REFERENCE: 1106 0 

; CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 32 

LENGTH: 373 

TYPE : PRT 

ORGANISM: chicken 
US-09-214-592-32 

Query Match 50.1%; Score 1059.5; DB 11; Length 373; 

Best Local Similarity 54.5%; Pred. No. 7e-91; 

Matches 208; Conservative 52; Mismatches 99; Indels 23; Gaps 8; 
Qy 30 LSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 89 

IMh :|:| hMllhllllMMII ••M II IIIU MINI-- ^ 

Db 2 LSTCQRLDLEAAKKKRIEAVRGQILSKLRLTAPPPASETPPRPLPDDVRALYNSTQELLK 61 

Qy 90 GES-VEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPV 14 8 

: : I h :hllh hi:: : :: Ih: :l|l I I 
Db 62 QRARLRPPPDGPDEYWAKELRRIPMETTWDGAMEHWQPQSHSIFFVFNVSRARRG-GRPT 120 

Qy 14 9 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGV 203 

II llllhll I II :|||l I I Mill I : : Hllllll 

Db 121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 18 0 



Qy 204 VRQWLTRREAIEGFRLSAHSSSD SKDNTLHVEINGFNSGRRGDLATIHGMNR- -PFL 258 

I llh I : hill : : : III Ulh :| :| I- 

Db 181 VHQWLSGSELLGVFKLSVHCPCEMGPGHAEEMRISIEGFEQ-QRGDMQSIAKKHRRVPYV 239 



Qy 259 LLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCF--SSTEKNCCVRQLYIDFRKDLGW 316 

I II I III llhl III Mhlll : lllllll IIIIMIII I 

Db 24 0 LAMALPAERANELHSARRRRD LDTDYCFGPGTDEKNCCVRPLYIDFRKDLQW 2 91 



Qy 317 KWIHEPKGYHANFCLGPCPYIWSLDTQYSKVIALYNQHNPGASAAPCCVPQALEPLPIVY 376 

IIIIMIII lllhllllllll lllhllllllllllllllllMIIII hlllhl 

Db 2 92 KWIHEPKGYMANFCMGPCPYIWSADTQYTKVLALYNQHNPGASAAPCCVPQTLDPLPIIY 351 



Qy 377 YVGRKPKVEQLSNMIVRSCKCS 398 

III! :|lllllhM:||ll 
Db 352 YVGRNVRVEQLSNMWRACKCS 373 



Search completed: October 28, 2003, 09:27:59 
Job time : 25.1473 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



October 28, 2003, 00:37:16 ; Search time 13.1517 Seconds 

(without alignments) 
2910.285 Million cell updates/sec 

US-10-017-372E'9 
2114 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNMIVRSCKCS 398 

BLOSUM62 

Gapop 10.0 , Gapext 0 , 5 

283308 seqs, 96168682 residues 



Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -process ing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



PIR_76:* 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



283308 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 



A27512 

transforming growth factor beta-1 precursor - pig 
N;Alternate names: TGF-beta 

C; Species; Sus scrofa domestica (domestic pig) 

C;Date: 05-Jun-1988 #sequence_revision 05-Jun-1988 #text_change 16-Jul-1999 

C;Accession: A27512; A26356; 146657 

R;Derynck, R.; Rhee, L. 

Nucleic Acids Res. 15, 3187, 1987 

A; Title: Sequence of the porcine transforming growth factor-beta precursor. 
A/Reference number: A27512; MUID : 87174844 ; PMID:3470708 

A /Access ion : A2 7512 
A; Molecule type: mRNA 
A/Residues : 1-390 <DER> 

R;Cheifetz, S.; Weatherbee, J.A. ; Tsang, M.L.S.; Anderson, J.K.; Mole, J.E.; 

Lucas, R . ; Massague, J. 
Cell 48, 409-415, 1987 

A;Title: The transforming growth factor-beta system, a complex pattern of cross- 
reactive ligands and receptors. 

A;Reference number: A90890; MUID : 87102890 ; PMID:2879635 

A; Accession : A26356 

A; Molecule type: protein 

A;ResidueS: 279-322 <CHE> 

R;Kondaiah, P.; Van Obberghen-Schilling, E.; Ludwig, R.L.; Dhar, R. ; Sporn, 
M.B.; Roberts, A.B. 

J. Biol. Chem. 263, 18313-18317, 1988 

A; Title: cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 
Evidence for alternate splicing and polyadenylation . 
A/Reference number: 146657; MUID: 89054010; PMID:2461367 
A /Access ion: 146657 

A /Status: preliminary; translated from GB/EMBL/DDBJ 
A /Molecule type: mRNA 
A/Residues: 1-390 <KON> 

A/ Cross-references: GB:M23703/ NID:g755044; PIDN : AAA64616 . 1 / PID:g755045 

C/Genetics : 

A /Gene: TGFB/ TGF -beta-1 
C/ Superfamily : inhibin 
C; Keywords: growth factor 

Query Match 96.4%/ Score 2038/ DB 2/ Length 390/ 

Best Local Similarity 97.2%/ Pred. No. 8.5e-156/ 

Matches 387/ Conservative 0/ Mismatches 3/ Indels 8/ Gaps 1/ 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 
Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 
Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL?CLKVEQHVELYQKYSNDSWR 18 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 
Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

IIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIMIII I lllllllllllllll 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 24 0 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII IIIIIIMIIIIII 
Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 2 
S01413 

transforming growth factor beta-1 precursor - chicken 

C;Species: Gallus gallus (chicken) 

C;Date: 30-Jun-1989 #sequence_revision 30-Jun-1989 #text_change 24-Nov-1999 
C; Accession: S01413 

R;Jakowlew, S.B.; Dillard, P.J.; Sporn, M.B.; Roberts, A.B. 
Nucleic Acids Res. 16, 8730, 1988 

A; Title: Nucleotide sequence of chicken transforming growth factor-beta 1 (TGF- 
beta 1) . 

A;Reference number: S01413; MUID: 88335639 ; PMID:3166520 
A;Accession: S01413 
A /Molecule type: DNA 
A; Residues: 1-391 <JAK> 

A; Cross-references : EMBL:X12373; NID:g63808; PIDN : CAA30933 . 1 ; PID:g63809 
C; Superfamily : inhibin 
C; Keywords: growth factor 

Query Match 94.9%; Score 2 006.5; DB 2; Length 391; 

Best Local Similarity 96.2%; Pred. No. 2.9e-153; 

Matches 384; Conservative 0; Mismatches 6; Indels 9; Gaps 2; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I III lllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 MPPSGPGLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

IIIIIIIIIMIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII lllll 

Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 
Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWG 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEIN-GF 23 9 

IIIIIIIIIIIIMIIIIIMIIIMIIIMIIIIIIIIIM I IIIIMIIIIII II 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINAGF 24 0 

Qy 240 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEK 299 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIII IMIIIIIIIill 

Db 241 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEK 292 

Qy 300 NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 359 



IIMIIIIMIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIMIIIIIIIIIIIIII 

Db 293 NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 352 

Qy 360 AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

lllllllllllllllllllllllllllllllllllllll 
Db 353 AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 391 



RESULT 3 
146463 

transforming growth factor beta-1 - sheep 

C; Species: Ovis orientalis aries, Ovis ammon aries (domestic sheep) 

C;Date: 19-Dec-1997 #sequence_revision 19-Dec-1997 #text_change 24-Nov-1999 

C;Accession; 146463; S45115 

R;Woodall, C.J.; McLaren, L.J.; Watt, N.J. 

Gene ISO, 371-373, 1994 

A; Title: Sequence and chromosomal localisation of the gene encoding ovine latent 

transforming growth factor-beta 1. 

A;Reference number: 146463; MUID: 95121932; PMID:7821809 
A; Access ion : 14 6463 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-390 <W00> 

A; Cross -references: EMBL:X76916; NID:g496648; PIDN : CAA54242 . 1 ; PID:g496649 
A; Note: submitted to the EMBL Data Library, December 1993 
C; Superf amily : inhibin 

Query Match 91.5%; Score 1934; DB 2; Length 3 90; 

Best Local Similarity 91.7%; Pred. No. 1.9e-147; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I Mllllllllllllllhllllll lllllllllllllllll lllllllllllllll 
Db 1 MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTIDMELVKRKGIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

IIIIIIMIIIIIIIhlllllllllllllll I llllllllllllllllllll Ihl 
Db 61 SPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

III I : Ihll llllllllllllllllllhHIIIIIIIIIIIIIMMIIhlll 
Db 121 YDKMKSSSHSIYMFFNTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

IIIIIIIIIMIIIIIIIIIIIIIIIIII II lllllllll I lllllll hlllh 
Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 3 00 

llllllllllllllllllllllllllllllllllllll IIIIIIIIMIIII 
Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIII 
Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIIMMMIIIIIIIIIIIIIIIIIIIIIIII 



Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 4 
WFHU2 

transforming growth factor beta-1 precursor [validated] - human 
N;Alternate names: growth-inhibitory factor; TGF type 2; TGF-beta 
C; Species: Homo sapiens (man) 

C;Date: 28-Feb-1986 #sequence_revision 19-Oct-1995 #text_change 08-Dec-2000 

C;Accession: A27513; A01395; A22290; 159664; S53444 
R;Derynck, R. ; Rhee, L.; Chen, E.Y.; Van Tilburg, A. 
Nucleic Acids Res. 15, 3188-3189, 1987 

A; Title: Intron-exon structure of the human transforming growth factor-beta 
precursor gene. 

A;Reference number: A27513; MUID : 87174845 ; PMID:3470709 
A; Accession: A27513 
A; Molecule type: DNA 
A;Residues: 1-390 <DER> 

A; Cross-references: GB:X05839; GB:Y00112; NID:g37097; PIDN : CAA29283 . 1 ; 
PID:gl212989 

R;Derynck, R. ; Jarrett, J.A. ; Chen, E.Y.; Eaton, D.H.; Bell, J.R.; Assoian, 
R.K.; Roberts, A.B.; Sporn, M.B.; Goeddel, D.V. 
Nature 316, 701-705, 1985 

A; Title; Human transforming growth factor-beta complementary DNA sequence and 
expression in normal and transformed cells. 
A;Reference number: A01395; MUID : 85296301 ; PMID:3861940 
A; Accession : AO 13 95 
A; Molecule type: mRNA 

A; Residues: 1-9, 'P' ,11-24, 'P' ,26-159, 'R' ,160-390 <DE2> 

A; Cross-references: GB:X02812; GB:J05114; NID:g37092; PIDN : CAA2 658 0 . 1 ; 

PID:g37093 

A; Note: the authors suggest that residues 8-23 could represent the hydrophobic 

core of an amino-terminal signal peptide 

R;Massague, J.; Like, B. 

J. Biol. Chem. 260, 2636-2645, 1985 

A;Title: Cellular receptors for type beta transforming growth factor. Ligand 

binding and affinity labeling in human and rodent cell lines. 

A;Reference number: A22290; MUID: 85131019; PMID:2982829 

A;Accession: A22290 

A;Molecule type: protein 

A;Residues: 279-295 , 'XX 298 -301 <MAS> 

R;Urushizaki, Y. ; Niitsu, Y.; Terui, T. ; Koshida, Y. ; Mahara, K. ; Kohgo, Y.; 
Urushizaki, I.; Takahashi, Y. ; Ito, H. 
Tumor Res. 22, 41-55, 1987 

A; Title: Cloning and expression of the gene for human transforming growth 
factor-beta in Escherichia coli. 
A;Reference number: 159664 
A; Access ion: 159664 

A;Status: preliminary; translated from GB/EMBL/DDBJ 

A /Molecule type: mRNA 
A;Residues: 279-390 <RES> 

A /Cross-references: GB:M38449; NID:g339557; PIDN : AAA36735 . 1 ; PID:g339558 
R;Stam, K. ; Stewart, A.A. ; Qu, G.Y.; Iwata, K.K.; Fenyoe, D. ; Chait, B.T.; 
Marshak, D.R.; Haley, J.D. 
Biochem. J. 305, 87-92, 1995 

A;Title: Physical and biological characterization of a growth- inhibitory 
activity purified from the neuroepithelioma cell line A673. 



A/Reference number: S53444; MUID: 95126934 ; PMID: 7826358 
A /Access ion : S53444 
A; Status : preliminary 
A;Molecule type: protein 
A;Residues: 279-297 <STA> 

C; Comment: The mature protein is the carboxyl -terminal segment of a precursor 
polypeptide; the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C; Genetics : 

A; Gene: GDB:TGFB1; TGFB 

A; Cross-references : GDB: 120729; OMIM: 190180 
A;Map position: 19ql3 . 2 - 19ql3 . 2 
C; Super family: inhibin 

C; Keywords: glycoprotein; growth factor; homodimer; mitogen; transformation 
F; 1 - IB/Domain : signal sequence #status predicted <SIG> 
F; 19-278/Domain: propeptide #status predicted <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F;279-390/Product : transforming growth factor beta-1 #status experimental <MAT> 
F; 82 , 136 , 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 90.9%; Score 1921; DB 1; Length 390; 

Best Local Similarity 91.7%; Pred. No. 2.1e-146; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIMIII IIIMIIIIIIIIIIIIIIIIIIMIMIIIMIIMIIIIIIIIIMII 

Db 1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

lllllhlllllMIMIIIIIIMIIIIIII IIIMIIMIIMIIIIIIIIh hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

UNI : Ihl! IIIMIIIIIIMIIIIIIMIIIMIMMIIMIIIIIIhlll 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

MIIIIIIIIMIIIIIIIIMIIMIhl lllllllll I Ihllll hllll 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

UIMMIIIIIIIIIIIMIIIIMIIIII IIIIM IMMIIIMMII 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIIIIIIIIIIIIIMIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 2 93 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MlllillMIMIIIIIIIIIIMIIIIIIIIMMI 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 5 

A26960 

transforming growth factor beta-1 precursor - green monkey 
C; Species: Cercopithecus aethiops (green monkey, grivet) 



C;Date: 05-Oct-1988 #sequence_revision 05-Oct-1988 #text_change 24-Nov-1999 

C /Access ion : A2 6 96 0 

R;Sharples, K. ; Plowman, G.D.; Rose, T.M.; Twardzik, D.R.; Purchio, A.F. 
DNA 6, 239-244, 1987 

A;Title: Cloning and sequence analysis of simian transforming growth factor-beta 
cDNA. 

A;Reference number: A26960; MUID: 87246074 ; PMID:3474130 
A; Accession : A26960 
A /Molecule type: mRNA 
A/Residues : 1-390 <SHA> 

A/Cross-references: GB:M16658; NID:gl76552; PIDN:AAA35369 . 1; PID:gl76553 
C;Superf amily : inhibin 
C; Keywords: growth factor 

F; 1-16/Domain: signal sequence #status predicted <SIG> 

F; 17-390/Product : transforming growth factor beta #status predicted <MAT> 

Query Match 90.8%; Score 1919; DB 2; Length 390; 

Best Local Similarity 91.5%; Pred. No. 3e-146; 

Matches 364; Conservative 10; Mismatches 16; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIMIIIIIIIIIIMIIM IIIIIIMIIIIIIIIIIIIIII IIIMIIIIIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTIDMELVKRKRIETIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I I I I : I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I h I :| 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIhlll 
Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

lllllllllhlllllllllllllMlhl lllllllll I lllllll hill! 

Db 181 YLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSKDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

:||IIIIIIIIIMIIIIIIIIIIIIIIIII llllll llllllllllllll 
Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 292 

Qy 3 01 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

II Ml lllllllll I III IIIIMII I Mil III III Mil II I II I III lllllll Ml 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIII 
Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 6 
JC4023 

transforming growth factor beta-1 - dog 
C;Species: Canis lupus familiaris (dog) 

C;Date: 13-Jun-1995 #sequence_revision 14-Jul-1995 #text_change 24-Nov-1999 
C; Access ion: JC4 023 

R;Manning, A.M.; Auchampach, J. A.; Drong, R.F.; Slightom, J.L. 
Gene 155, 307-308, 1995 



A; Title: Cloning of a canine cDNA homologous to the human transforming growth 
factor-beta 1 -encoding gene. 

A;Reference number: JC4023; MUID: 95237630 ; PMID:7721110 
A /Access ion : JC4 023 
A; Molecule type: mRNA 
A;Residues: 1-390 <MAN> 

A; Cross-references : GB:L34956/ NID:g516071; PIDN : AAA51458 . 1 ; PID:g516072 

C;Comment: This factor plays a multifunctional role as a regulator of mammalian 

cell growth and as a modulator of immune responses. 

C; Genetics : 

A;Gene: tgf-betal 

C; Superf amily : inhibin 

C; Keywords: growth factor; transforming protein 

F;288-390/Product : transforming growth factor beta 1 #status predicted <MAT> 

Query Match 8 9.8%; Score 18 98; DB 2; Length 3 90; 

Best Local Similarity 90.7%; Pred. No. 1.5e-144; 

Matches 361; Conservative 12; Mismatches 17; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I llllllllllllll IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMI: 

Db 1 MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLS 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhIM lllllllllllllllllllll llllllllllllllllllllll: hi 
Db 61 SPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVENTNKI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I UlhlMMIIMMIMMMMIMIMIIIIII IMMIIMIMMII 

Db 121 YEKVKKSPHSIYMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

IIIMIMIIhMIMIMIMMIIh :IMIMM I IIIIMI hlllh 

Db 181 YLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

I MIMMMMIIMIMIMIMIMMIMI M MIMMIIMMI 

Db 241 SSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQRR ALDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIMMIIMMIIIIMMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MMIMIMIMIMMIMIMIIMMIIIIMM 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 7 
WFMS2 

transforming growth factor beta-1 precursor - mouse 
N; Alternate names: TGF type 2; TGF-beta 
C; Species: Mus musculus (house mouse) 

C;Date: 04-Dec-1986 #sequence_revision 04-Dec-1986 #text_change 24-Nov-1999 

C; Accession: AO 13 96 

R;Derynck, R.; Jarrett, J. A.; Chen, E.Y.; Goeddel, D.V. 
J. Biol. Chem. 261, 4377-4379, 1986 



A;Title: The murine transforming growth factor-beta precursor. 
A;Reference number: A01396/ MUID : 86168129 ; PMID: 3007454 

A;Accession; A01396 
A; Molecule type: mRNA 
A/Residues: 1-390 <DER> 

A; Cross-references : GB:M13177; NID:g201952; PIDN : AAA40423 . 1 ; PID:g201953 
A;Note: the authors suggest that residues 8-23 could represent the hydrophobic 
core of an ammo-terminal signal peptide 

C; Comment: The mature protein is the carboxyl -terminal segment of a precursor 

polypeptide; the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C;Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor; growth regulation; homodimer; mitogen; 
trans format ion 

F; 1-23/Domain: signal sequence #status predicted <SIG> 
F;24-278/Domain: propeptide #status predicted <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F;279-390/Product : transforming growth factor beta-1 #status predicted <MAT> 
F; 82 , 136 , 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 87.0%; Score 1840; DB 1; Length 390; 

Best Local Similarity 86.9%; Pred. No. 6,6e-140; 

Matches 346; Conservative 15; Mismatches 29; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I lllllllllllll MIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

lllllhlllllllllllllllllllllMII :||IMIIIIIIIIIIIIIIh II 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I Ihll lllhHIIIIII MIMIIII III llllllllllllhIM 

Db 121 YEKTKDISHSIYMFFNTSDIREAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

M INI hhlllllllllMIIIII : : hill III I Mill Mlllll : 
Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNKLHVEINGIS 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

Mill III IIIMIMIIIIMIIIIIMIMM IIIIIMMIMII 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIIMIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIMIIIIIIIMI 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MIIMMIIIMIIIIMIIMIIIIIIIIIIIIIII 

Db 353 SPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 8 

S10219 

transforming growth factor beta-1 precursor - rat 
N;Alternate names: TGF type 2; TGF-beta 



C; Species: Rattus norvegicus (Norway rat) 

C;Date: 12-Feb-1993 #sequence_revision 12-Feb-1993 #text_change 24-Nov-1999 
C;Accession: S10219; PT0023; S02267 

R;Qian, S.W.; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Nucleic Acids Res. 18, 3059, 1990 

A; Title: cDNA cloning by PGR of rat transforming growth factor beta-1. 
A;Reference number: S10219; MUID : 90272425 ; PMID:2349108 
A, -Access ion: SI 02 19 
A /Molecule type: mRNA 
A;ResidueS: 1-390 <QIA> 

A; Cross -references : EMBL:X52498; NID:g57341; PIDN : CAA3674 1 . 1 ; PID:g57342 
R;Okada, F./ Yamaguchi, K. ; Ichihara, A.; Nakamura, T. 
J. Biochem. 106, 304-310, 1989 

A;Title: Purification and structural analysis of a latent form of transforming 

growth factor-beta from rat platelets. 

A; Reference number: PT0023; MUID : 90036779 ; PMID:2478527 
A; Accession: PT0023 

A;Molecule type: protein 

A; Residues: 30-32, 'X',34-38, 'Q', 40-42, 'X',44 <OKA> 
R;Okada, F.; Yamaguchi, K, ; Ichihara, A.; Nakamura, T. 
FEBS Lett. 242, 240-244, 1989 

A;Title: One of two subunits of masking protein in latent TGF-beta is a part of 

pro-TGF-beta . 

A; Reference number: S02267; MUID : 89121078 ; PMID:2914605 
A;Accession : S02267 
A;Molecule type: protein 

A; Residues: 30-32, 'X', 34 -38, 'Q', 40-42, 'X',44 <0K2> 
C; Superf amily : inhibin 

C;Keywords: glycoprotein; growth factor; integrin binding 
F; 1-29/Domain: signal sequence #status predicted <SIG> 
F; 30-278/Domain : propeptide #status experimental <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F;279-390/Product : transforming growth factor beta-1 #status predicted <MAT> 
F;82, 136, 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 87.0%; Score 1840; DB 2; Length 390; 

Best Local Similarity 86.9%; Pred. No. 6.6e-140; 

Matches 34 6; Conservative 14; Mismatches 30; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I lllllllllllll IIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhillMIIIIIIIIIIIIIMIIIII :||IIIIIIIIIIIIIIIMh II 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mi I Ihll lll|::|lllll| IIMIIIM I I llllllllllllhlll 
Db 121 YDKTKDITHS I YMFFNTSDI REAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

II INI hhlllllllMIMIIII : : hill III I Mill lllllll : 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNVLHVEINGIS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 3 00 

IMII III IIMIIIIIIIIIIIIIIIIIIIIII IIIIIIIMIIIII 



Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 292 



Qy 



301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 



Db 



293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 



Qy 



361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 



Db 




RESULT 9 
A40057 

transforming growth factor beta-1 precursor - bovine (fragment) 

N;Alternate names: beta-TGF; cart ilage- inducing factor-A; EGF-dependent TGF or 

dEGF; MGF-b; milk growth factor b; TGF- type II 

C;Species: Bos primigenius taurus (cattle) 

C;Date: 28 -Feb- 1992 #sequence_revision 28-Feb-1992 #text_change 16-Jul-1999 
C;Accession: A40057; A42320; A05284; A24322; B61439 

R;Van Obberghen -Schilling, E.; Kondaiah, P.; Ludwig, R.L.; Sporn, M.B.; Baker, 
C.C. 

Mol. Endocrinol. 1, 693-698, 1987 

A; Title: Complementary deoxyribonucleic acid cloning of bovine transforming 
growth f actor-betal . 

A;Reference number: A40057; MUID: 91042552 ; PMID: 3153459 
A, -Access ion: A4 0057 
A /Molecule type: mRNA 
A;Residues; 1-315 <VAN> 

A; Cross-references : GB:M36271; NID:gl63747; PIDN : AAA30778 . 1 ; PID:gl63748 
R;Ogawa, Y.; Schmidt, D.K./ Dasch, J.R,; Chang, R.J,; Glaser, C.B. 
J. Biol. Chem. 267, 2325-2328, 1992 

A;Title: Purification and characterization of transforming growth f actor-beta2 . 3 

and -betal.2 heterodimers from bovine bone. 

A; Reference number: A42320; MUID: 92129307; PMID:1733936 

A; Access ion: A4232 0 

A;Molecule type: protein 

A;Residues: 204-209, 'X' ,211-217 <OGA> 

R;Roberts, A,B.; Anzano, M.A. ; Meyers, C.A, ; Wideman, J.; Blacher, R. ; Pan, 
Y.C.E.; Stein, S.; Lehrman, S.R.; Smith, J.M. ; Lamb, L.C.; Sporn, M.B. 
Biochemistry 22, 5692-5698, 1983 

A;Title: Purification and properties of a type beta transforming growth factor 
from bovine kidney. 

A;Reference number: A05284; MUID: 84104793 ; PMID:6607069 

A; Accession: A052 84 

A; Molecule type: protein 

A;Residues: 204-218 <ROB> 

R;Seyedin, S.M.; Thompson, A.Y.; Bentz, H.; Rosen, D.M. ; McPherson, J.M.; Conti, 
A.; Siegel, N.R.; Galluppi, G.R.; Piez, K.A. 
J. Biul. Chem. 261, 5693-5695, 1986 

A;Title: Cart ilage- inducing factor-A. Apparent identity to transforming growth 
factor-beta . 

A;Reference number: A24322; MUID: 86195954 ; PMID:3754555 
A;Accession: A24322 
A;Molecule type: protein 
A;Residueg: 204-233 <SEY> 

R;Jin, Y.; Cox, D.A. ; Knecht , R.; Raschdorf, F.; Cerletti, N. 
J. Protein Chem. 10, 565-575, 1991 



A; Title: Separation, purification, and sequence identification of TGF-betal and 

TGF-beta2 from bovine milk. 

A; Reference number: A61439; MUID : 92189724 ; PMID:1799413 
A; Accession: B61439 
A;Molecule type: protein 

A;ResidueS: 204-209, 'X' ,211-217, 'XX' ,220-232 <JIN> 

C; Comment: This polypeptide is composed of two polypeptide chains cross-linked 
by disulfide bonds. It has been found in neoplastic and non-neoplastic tissues. 
C;Comment: Type II TGF does not bind to the EOF receptor and lacks intrinsic 
mitogenic activity, but in soft agar, it reacts synergist ically with either type 
I TFG or EOF, and induces cell proliferation. Cells grown in monolayer do not 
respond in a similar manner to these growth factors, but morphologically do 
acquire a transformed phenotype. 
C ; Super f ami ly : inhibin 

C; Keywords: glycoprotein; growth factor; heterodimer 

F;204-315/Product : transforming growth factor beta-1 #status experimental <MAT> 
F;7, 61, 101/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 74.6%; Score 1577; DB 2; Length 315; 

Best Local Similarity 91.3%; Pred. No. 6.3e-119; 

Matches 295; Conservative 9; Mismatches 11; Indels 8; Gaps 1; 

Qy 76 AVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLF 135 

hlllllllllllllll I IIIMIIIIIIIIIIIIIII Ihllll I : Ihll I 
Db 1 AILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKIYDKMKSSSHSIYMFF 60 

Qy 136 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

MM IIIIIIMIIIIIhUlllllllllllllllllllhllllMIIIIIIIIIIII 

Db 61 NTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 120 

Qy 196 LSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNR 255 

Illllllllllllilll IMIIIIII I lllllll hlllhllillllllllllll 
Db 121 LSFDVTGWRQWLTRREEIEGFRLSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNR 180 

Qy 256 PFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLG 315 

IIIIIIIIIMIIIIIIIIIIII IIIIMIIIIIIIIIIIIIIIIIIIIIII 
Db 181 PFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQLYIDFRKDLG 232 

Qy 316 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 375 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 
Db 233 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 292 

Qy 376 YYVGRKPKVEQLSNMIVRSCKCS 398 

MIMIIIIIIIMIIIIIIIII 

Db 293 YYVGRKPKVEQLSNMIVRSCKCS 315 



RESULT 10 
A41918 

transforming growth factor beta -4 precursor - chicken (fragment) 

N;Alternate names: TGF-beta 4 

C; Species: Gallus gallus (chicken) 

C;Date: 31-Dec-1993 #sequence_revision 31-Dec-1993 #text_change 16-Jul-1999 

C;Accession: A41918; A34941; S03110 

R;Burt, D.W.; Jakowlew, S.B. 

Mol. Endocrinol. 6, 989-992, 1992 



A; Title: Correction: a new interpretation of a chicken transforming growth 

factor-beta 4 complementary DNA. 

A/Reference number: A41918; MUID : 92357039 / PMID: 1353860 
A; Accession: A41918 
A; Molecule type: mRNA 
A/Residues : 1-373 <BUR> 

A; Cross-references: GB:M31160; GB:X08012; GB:S41706; NID:gl262437 ; 
PIDN:AAB05637. 1; PID:gl262438 

A;Note: Sequence extracted from NCBI backbone (NCBIN: 110186, NCBIP : 110187) 

A;Note: this report corrects and reinterprets the sequence from reference A34941 
R;Jakowlew, S.B.; Dillard, P.J.; Sporn, M.B.; Roberts, A.B. 
Mol. Endocrinol. 2, 1186-1195, 1988 

A; Title: Complementary deoxyribonucleic acid cloning of a messenger ribonucleic 
acid encoding transforming growth factor beta -4 from chicken embryo 
chondrocytes . 

A/Reference number: A34941; MUID: 89112198 ; PMID:2464131 
A /Accession : A34 941 
A /Molecule type: mRNA 

A; Residues : 'MDPMSIGPKSCGGSPWRPPGTAPWSIGSRRATASSSCSTSSRVRAEVGGRAL' , 122- 

209, 'D' ,211-373 <JAK> 

A; Cross -references : EMBL:X08012 

A; Note: this sequence has been corrected in A41918 
C; Superfamily : inhibin 

C;Keywords: glycoprotein; growth factor 

F;l/Domain: signal sequence (fragment) #status predicted <SIG> 
F;223-225/Region: cell attachment (R-G-D) motif 

F;260-373/Product : transforming growth factor beta-4 #status predicted <MAT> 
F;54, 109, 153/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 50.1%; Score 1059.5; DB 2; Length 373; 

Best Local Similarity 54.5%; Pred. No. 2.7e-77; 

Matches 2 08; Conservative 52; Mismatches 99; Indels 23; Gaps 8; 

Qy 30 LSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 89 

lllh :|:| hllllhllllllMII :|| : II llh I lllllh: : 
Db 2 LSTCQRLDLEAAKKKRIEAVRGQILSKLRLTAPPPASETPPRPLPDDVRALYNSTQELLK 61 

Qy 90 GES-VEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPV 148 

: : I |: :|:i||: |: | : : : :: ||:: :|| I I I 
Db 62 QRARLRPPPDGPDEYWAKELRRIPMETTWDGAMEHWQPQSHSIFFVFNVSRARRG-GRPT 120 

Qy 14 9 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGV 203 

II lillhll I II :MII I I IIMI I : : llllllll 

Db 121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 18 0 

Qy 204 VRQWLTRREAIEGFRLSAHSSSD---SKDNTLHVEINGFNSGRRGDLATIHGMNR--PFL 258 

I llh I : hll I : : : III Mlh :| :| I- 

Db 181 VHQWLSGSELLGVFKLSVHCPCEMGPGHAEEMRISIEGFEQ-QRGDMQSIAKKHRRVPYV 23 9 

Qy 259 LLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCF--SSTEKNCCVRQLYIDFRKDLGW 316 

I II I III llhl III Ilhlll ^ lllllll llllllill I 

Db 240 LAMALPAERANELHSARRRRD LDTDYCFGPGTDEKNCCVRPLYI DFRKDLQW 291 

Qy 317 KWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVY 376 

llllllill Ml hi I Ml III MlhlMMMMMIMIIMMM h I II IM 

Db 292 KWIHEPKGYMANFCMGPCPYIVJSADTQYTKVLALYNQHNPGASAAPCCVPQTLDPLPIIY 351 



Qy 377 YVGRKPKVEQLSNMIVRSCKCS 398 

MM MMMMMhMM 

Db 352 YVGRNVRVEQLSNMWRACKCS 373 



RESULT 11 

A34939 

transforming growth factor beta-3 precursor - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 13- Jul -1990 #sequence_revision 13-Jul-1990 #text_change 16-Jul-1999 
C;Accession: A34939; S25850; S36125; S36124; 151181 

R;Jakowlew, S.B.; Dillard, P.J.; Kondaiah, P.; Sporn, M.B.; Roberts, A.B. 
Mol. Endocrinol. 2, 747-755, 1988 

A;Title: Complementary deoxyribonucleic acid cloning of a novel transforming 
growth factor-beta messenger ribonucleic acid from chick embryo chondrocytes. 
A;Reference number: A34939; MUID: 89096966 ; PMID:3211158 
A; Accession: A34 939 

A;Status: preliminary; not compared with conceptual translation 
A /Molecule type: mRNA 
A/Residues: 1-412 <JAK> 

A; Cross -references: GB:M31154; NID:g212758; PIDN: AAA4 9089 . 1 ; PID:g212759 
R;Burt, D.W.; Paton, I.R.; Dey, B.R. 
J. Mol. Endocrinol. 1, 175-183, 1991 

A;Title: Comparative analysis of human and chicken transforming growth factor- 
beta-2 and -beta-3 promoters. 

A;Reference number: S25850; MUID: 92134496; PMID: 1840616 
A /Accession: S25850 
A; Status : preliminary 
A; Molecule type: DNA 
A;Residues: 1-117 <BUR> 

A; Cross-references: EMBL:X58127; NID:g63815; PIDN: CAA41128 . 1; PID:g63816 
A; Access ion: S3 6 12 5 

A; status: preliminary; nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A/Residues : 119-172 <BU2> 

A; Cross-references: EMBL:X60055; NID:g396688; PIDN: CAA42 653 . 1 ; PID:g396689 
A;Note: the nucleotide sequence was submitted to the EMBL Data Library, June 
1991 

A; Access ion: S3 6 12 4 

A; status: preliminary; nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 

A;ResidueS: 173-322, 'ELPT' ,327-412 <BU3> 
A; Cross-references : EMBL:X60091 

A;Note: the nucleotide sequence was submitted to the EMBL Data Library, June 
1991 

R;Jakowlew, S.B.; Lechleider, R.; Geiser, A.G.; Kim, S.J.; Santa-Coloma , T.A.; 
Cubert, J.; Sporn, M.B.; Roberts, A.B. 
Mol. Endocrinol. 6, 1285-1298, 1992 

A;Title: Identification and characterization of the chicken transforming growth 

factor-beta 3 promoter. 

A;Reference number: 151181; MUID: 93024487; PMID: 1406706 
A; Access ion: 151181 

A; Status: preliminary 
A; Molecule type: DNA 
A;Residues: 1-117 <JA2> 

A; Cross-references: GB:S46000; NID:g257l72; PIDN:AAB23575 . 1; PID:g257173 
C; Genetics : 



A;Introns: 216/1; 252/1; 309/2; 360/3 

A; Note: list of mtrons may be incomplete 

C; Superf amily : inhibin 

C;Keywords: glycoprotein; growth factor; growth regulation; homodimer; mitogen 

F; 1-22/Domain : signal sequence #status predicted <SIG> 
F;23-300/Domain: propeptide #status predicted <PRO> 

F; 301 -412/Product : transforming growth factor beta-3 #status predicted <MAT> 
F; 74 , 142/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 41.4%; Score 876; DB 2; Length 412; 

Best Local Similarity 47.1%; Pred, No. 1.7e-62; 

Matches 198; Conservative 56; Mismatches 114; Indels 52; Gaps 14; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I :|: : Ihl hhl : h I h I I I I I I I I I I I I I III I I H 

Db 9 LVLLSLLSFATVSIALSSCTTLDLEHIKKKRVEAIRGQILSKLRLTSPPE--SVGPAHVP 66 

Qy 75 EAVLALYNSTRDRVAGESVEPEPE PEADYYAKEVTRVLMVE SGNQIYDKF 124 

:|llllllh : I :| I I h:|lllh : I- 1- 

Db 67 YQILALYNSTRELL--EEMEEEKEESCSQENTESEYYAKEIHKFDMIQGLPEHNELGICP 124 

Qy 12 5 KGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND- - - 177 

II III I I III hlh : II Uhl I 

Db 125 KGVTSNVFR-FNVS---SAEKNSTNLFRAEFRVLRVPNPSSKRSEQRIELFQILRPDEHI 180 

Qy 178 -SWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKD 229 

MM I : MIIIIMII IhM Ih I Ml : I : 

Db 181 AKQRYLSGRNVQTRGSPEWLSFDVTDTVREWLLHRESNLGLEISIHCPCHTFQPNGDILE 24 0 

Qy 23 0 N---TLHVEINGFNSG RRGDLATI HGMNRPFLLLMATPLERAQH- -LHSSRHRR 278 

I I - I M MM : :: IhM I I : 1 I M 

Db 241 NLHEVLEIKFKGIDSEDDYGRGDLGRLKKQKDLHNPHLILMMLPPHRLESPTLGGQRKKR 300 

Qy 279 DYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 338 

llllllll ^ IMIIII MIMIMMIMMIIIII MM llllh 

Db 301 ALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYFANFCSGPCPYLR 352 

Qy 339 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

I II M II III II IIIMIIIII MM IMIIII IIIIIIIIIMMIIII 

Db 353 SADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 



RESULT 12 
B61036 

transforming growth factor beta -5 precursor - African clawed frog 
C; Species: Xenopus laevis (African clawed frog) 

C;Date: 31-Dec-1993 #sequence_revision 03-Feb-1994 #text_change 16-Jul-1999 
C;Accession: A34929; B61036 

R;Kondaiah, P.; Sands ^ M.J.; Smith, J.M.; Fields, A.; Roberts, A.B.; Sporn, 

M.B. ; Melton, D.A. 

J. Biol. Chem- 265, 1089-1093, 1990 

A;Title: Identification of a novel transforming growth factor-beta (TGF-beta5) 
mRNA in Xenopus laevis. 

A; Reference number: A34929; MUID : 90110090 ; PMID:2295601 
A; Accession : A3 4 929 
A; Molecule type: mRNA 
A; Residues: 1-382 <KON> 



A;Cross-references: GB:J05180; NID:g214821; PIDN : AAA49968 . 1 ; PID:g214822 
R;Roberts, A.B.; Rosa, F.; Roche, N.S.; Coligan, J.E.; Garfield, M.; Rebbert, 
M.L./ Kondaiah, P.; Danielpour, D. ; Kehrl, J.H.; Wahl, S.M.; Dawid, I.B.; Sporn, 
M.B. 

Growth Factors 2, 135-147, 1990 

A;Title: Isolation and characterization of TGF-beta2 and TGF-beta5 from medium 
conditioned by Xenopus XTC cells. 

A;Reference number: A61036; MUID: 90253806; PMID: 2340184 
A; Accession: B61036 
A;Molecule type: protein 

A; Residues: 271-276, 'X' ,278-284, 'XX ',287-299 <ROB> 
C; Superf amily : inhibin 

C; Keywords: growth factor 

F; 271 -382/Product : transforming growth factor beta-5 #status experimental <MAT> 

Query Match 41.3%; Score 873; DB 2 ; Length 382; 

Best Local Similarity 46.9%; Pred. No. 2.6e-62; 

Matches 190/ Conservative 55; Mismatches 122; Indels 38; Gaps 11; 

Qy 9 LPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDV 68 

: :| II MM I Mill MM h:MIIMllllllllM I I 

Db 1 MEVLWMLLVLLVLHLSSLAMSLSTCKAVDMEEVRKRRIEAIRGQILSKLKLDKTPDV-DS 59 

Qy 69 PPGPLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDK 123 

M : MM! : : :: I : MIMM I 11-1 

Db 60 EKMTVPSEAIFLYNSTLEVIREKATREEEHVGHDQNIQDYYAKQVYR---FESITELED- 115 

Qy 124 FKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKL--KVEQHVELYQKYSNDSW-- 179 

I II I Ml I II IIIM : : :M MM II : 

Db 116 HEFKFKFNASHVRENVGMNSLLHHAELRMYKKQTDKNMDQRMELFWKYQENGTTH 17 0 

Qy 180 -RYLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGFRLSAH---SSSDSKDNTLHVE 235 

III : I IIMMII I Ml I I I I I : Ml 
Db 171 SRYLESKYITPVTDDEWMSFDVTKTVNEWLKRAEENEQFGLQPACKCPTPQAKD ID 226 

Qy 236 INGFNSGRRGDLATIHGM--NRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYC 293 

III Mill:: :|:|:,. : I II : ||| :| : || 

Db 227 lEGF-PALRGDLASLSSKENTKPYLMITSMPAERIDTVTSSRKKR GVGQEYC 277 

Qy 294 FSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQ 353 

I : Illh MhlMMIMIMMMI Ihlll IIMIhllllllllMIII 

Db 278 FGNNGPNCCVKPLYINFRKDLGWKWIHEPKGYEANYCLGNCPYIWSMDTQYSKVLSLYNQ 337 

Qy 354 HNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

:|||ll :|llll IIIMhlllll IIIIIIIIMIII II 
Db 338 NNPGASISPCCVPDVLEPLPIIYYVGRTAKVEQLSNMWRSCNCS 382 



RESULT 13 

S01825 

transforming growth factor beta -3 precursor - pig 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 30-Sep-1989 #sequence_revision 30-Sep-1989 #text_change 16-Jul-1999 

C; Access ion: SO 18 2 5 

R;Derynck, R.; Lindquist, P.B.; Lee, A.; Wen, D. ; Tamm, J.; Graycar, J.L.; Rhee, 
L.; Mason, A.J.; Miller, D.A.; Coffey, R.J.; Moses, H.L.; Chen, E,Y. 
EMBO J. 7, 3737-3743, 1988 



A;Title: A new type of transforming growth factor-beta, TGF-beta3. 
A;Reference number: S01824; MUID: 89091120; PMID:3208746 
A /Access ion: S01825 
A /Molecule type: mRNA 
A;Residues: 1-409 <DER> 

A; Cross-references : EMBL:X14150; NID:g2127; PIDN : CAA32363 . 1 ; PID:g2128 

C; Super family: inhibin 
C;Keywords: growth factor 

F; 1-25/Domain: signal sequence #status predicted <SIG> 

F;26-297/Domain: propeptide ^status predicted <PRO> 

F;298-409/Product : transforming growth factor beta-3 #status predicted <MAT> 

Query Match 39.7%; Score 838.5; DB 2; Length 409; 

Best Local Similarity 45.6%; Pred. No. 1.7e-59; 

Matches 190; Conservative 58; Mismatches 122; Indels 47; Gaps 14; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I :| : :||| hi : : I I I I : I I I I I I I I I I I I I IN : :{ 

Db 7 LVVLALLNFATVSLSMSTCTTLDFDHIKRKRVEAIRGQILSKLRLTSPPDPSML--ANIP 64 

Qy 75 EAVLALYNSTR DRVAGESVE- -PEPEPEADYYAKEVTRVLMV ESGNQIYDKFKG 126 

II mill : I M : : |::||ll|: : I : I I : I I 

Db 65 TQVLDLYNSTRELLEEVHGERGDDCTQENTESEYYAKEIYKFDMIQGLEEHNDLAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

-III:: I III hlh : II Uhl I 

Db 125 ITSKIFR-FNVSSVEK---NETNLFRAEFRVLRMPNPSSKRSEQRIELFQILQPDEHIAK 180 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 23 0 

l|: : I : llllllll Ihll llh I HI : I :| 

Db 181 QRYIDGKNLPTRGAAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGDILENI 240 

Qy 231 --TLHVEINGFNS---GRRGDLATIHGM--NRPFLLLMATPLERAQH--LHSSRHRRDYK 281 

: :: I U MM : : I Mil I :| : I : I :| 
Db 241 QEVMEIKFKGVDSEDDPGRGDLGRLKKKKEHSPHLILMMIPPDRLDNPGLGAQRKKR 297 

Qy 282 DDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLD 341 

IMIMII : hlllll lllllhlllllhlllMhIIII Mill: I I 
Db 298 ALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSAD 352 

Qy 342 TQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

I :| II III II llhllllll MM hlllll lllllllhhillll 
Db 353 TTHSSVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTAKVEQLSNMWKSCKCS 409 



RESULT 14 
A41397 

transforming growth factor beta-3 precursor - mouse 
C;Species: Mus musculus (house mouse) 

C;Date: 03-Apr-1992 #sequence_revision 03-Apr-1992 #text__change 16-Jul-1999 
C;Accession: A41397; A61039; A61225 

R;Miller, D.A.; Lee, A.; Matsui, Y.; Chen, E.Y.; Moses, H.L.; Derynck, R. 
Mol. Endocrinol. 3, 1926-1934, 1989 

A; Title: Complementary DNA cloning of the murine transforming growth factor- 
beta3 (TGFbeta3) precursor and the comparative expression of TGFbeta3 and 
TGFbetal messenger RNA in murine embryos and adult tissues. 
A;Reference number: A41397; MUID: 90190650; PMID:2628730 



A; Accession: A413 97 
A; Molecule type: mRNA 
A/Residues : 1-410 <MIL> 

A; Cross-references: GB:M32745; NID:g201949; PIDN : AAA40422 . 1 ; PID:g201950 
R;Denhez; F.; Lafyatis, R. ; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Growth Factors 3, 139-146, 1990 

A;Title: Cloning by polymerase chain reaction of a new mouse TGF-beta, mTGF- 
beta3 . 

A/Reference number: A61039; MUID: 91000714 ; PMID: 2206556 
A; Access ion: A6103 9 
A; Molecule type: mRNA 
A;Residues: 1-410 <DEN> 

R;Watrin, F.; Scotto, L. ; Assoian, R.K.; Wolgemuth, D.J. 
Cell Growth Differ. 2, 77-83, 1991 

A; Title: Cell lineage specificity of expression of the murine transforming 

growth factor beta-3 and transforming growth factor beta-1 genes. 

A/Reference number: A61225; MUID: 91299576; PMID: 2069871 

A; Access ion : A6122 5 

A; Status: translation not shown 

A; Molecule type: mRNA 

A;Residues: 285-410 <WAT> 

C; Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor; growth regulation 
F; 1-21/Domain: signal sequence #status predicted <SIG> 
F;22-298/Domain: propeptide #status predicted <PRO> 

F;259-261/Region: cell attachment (R-G-D) motif 

F;299-410/Product : transforming growth factor beta-3 #status predicted <MAT> 
F;72, 133, 140/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 39.5%; Score 835; DB 2; Length 410; 

Best Local Similarity 45.2%; Pred. No. 3.3e-59; 

Matches 188; Conservative 58; Mismatches 126; Indels 44; Gaps 13; 
Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSPCLRLASPPSQGDVPPGPLP 74 

h I H : MM hi MMhMMMMMMI Ml : M 

Db 7 LWLALLNLATISLSLSTCTTLDFGHIKKKRVEAIRGQILSKLRLTSPPEPSVMT--HVP 64 

Qy 75 EAVLALYNSTR---DRVAGESVE--PEPEPEADYYAKEVTRVLMVE SGNQIYDKFKG 126 

MMMMI : : M I : l-MMh : i- I- II 

Db 65 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFDMIQGLAEHNELAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

M I : : I Ml hlh : II Mhl I 

Db 125 ITSKVFR-FNVSSVEK NGTNLFRAEFRVLRVPNPSSKRTEQRIELFQILRPDEHIAK 180 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 230 

Ih : I : IMIIMI IhM MM I :| I : I M 

Db 181 QRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGDILENV 240 

Qy 231 --TLHVEINGFNS---GRRGDLATIHGM---NRPFLLLMATPLERAQHLHSSRHRRDYKD 282 

: - I MM : : I MM I I h 
Db 241 HEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNPHLILMMIPPHRLDSPGQGSQRK 296 

Qy 2 83 DDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 342 

MMMMI : IMMM M M II M 1 1 II I M I M I h II M MUM I II 

Db 297 --KRALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSADT 354 



Qy 




Db 



RESULT 15 

A55706 

transforming growth factor beta -3 precursor - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 03-Mar-1995 #sequence_revision 03-Mar-1995 #text_change 16-Jul-1999 
C;Accession: A55706; B40699; S36042 

R;Wang, J.; Kuliszewski, M. ; Yee, W. ; Sedlackova, L. ; Xu, J.; Tseu, I.; Post, M. 
J. Biol. Chem. 270, 2722-2728, 1995 

A;Title: Cloning and expression of glucocorticoid- induced genes in fetal rat 

lung fibroblasts. Transforming growth f actor-beta-3 . 

A/Reference number: A55706; MUID: 95155340 ; PMID:7852342 

A /Accession: A55706 

A; Molecule type: mRNA 

A;Residues: 1-410 <WAN> 

A; Cross-references : GB : U034 91 

A;Note: it is uncertain whether Met-1 is the initiator 
R;McKinnon, R.D.; Piras, G. ; Ida Jr., J.A. ; Dubois-Dalcq, M. 
J. Cell Biol. 121, 1397-1407, 1993 

A;Title: A role for TGF-beta in oligodendrocyte differentiation. 

A/Reference number: A40699; MUID: 93286190; PMID: 8509457 

A /Access ion : B4 0699 

A /Status : preliminary 

A /Molecule type: mRNA 

A/Residues: 157-211 <MCK> 

A/ Cross -references: EMBL:X71903/ NID:g311326/ PIDN : CAA50722 . 1 / PID:g311327 
C; Super family: inhibin 

Query Match 39.5%; Score 834/ DB 2/ Length 410; 

Best Local Similarity 45.2%/ Pred . No. 3.9e-59/ 

Matches 188/ Conservative 58/ Mismatches 126/ Indels 44/ Gaps 13; 
Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 



I • I • I • I I I I [ • I • I • 1 I • I I I I M I I I I I I I I 1 I • -I 

7 LWLALLNLATVSLSLSTCTTLDFGHIKKKRVEAIRGQILSKLRLTSPPEPSVMT- -HVP 





Db 



Qy 



75 EAVLALYNSTR DRVAGESVE- -PEPEPEADYYAKEVTRVLMVE- - -SGNQIYDKFKG 126 



Db 



65 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFDMIQGLAEHNELAVCPKG 1 




Qy 



127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

-III:: I III hlh : II :||:| I 

125 ITSKVFR-FNVSSVEK---NGTNLFRAEFRVLRVPNPSSKRTEQRIELFQILRPDEHIAK 180 



Db 



Qy 



179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 23 0 

Ih : I : MIMIII Ihll III: I HI : I :| 

181 QR Y I GG KNLPTRGTAEWLS FDVTDTVREWLLRRESNLGLE I S I HCPCHTFQPNGDI LENV 24 0 



Db 



Qy 



231 - - TLHVEI NGFNS - - - GRRGDLATI HGM - - -NRPFLLLMATPLERAQHLHSSRHRRDYKD 282 

: I :: MM : : I MM I I M 
241 HEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNPHLILMMIPPHRLDSPGQGGQRK 296 



Db 



Qy 



283 DDDKALDTNYCFSSTEKNCCVRQLYIDFRKDIiGWKWIHEPKGYHANFCLGPCPYIWSLDT 342 



MIIIIMI • hlllll IIMMHIIIIhlllllhllll Mill: I II 

Db 297 --KRALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSSDT 354 

Qy 343 QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

''\ II III II llhlMIII MM hlMM llllllllhhillll 

Db 355 THSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 410 



Search completed: October 28, 2003, 09:09:47 
Job time : 15.1517 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



October 28, 2003, 00:08:41 ; Search time 7.97725 Seconds 

(without alignments) 
2346.251 Million cell updates/sec 

US-10-017-372E-9 
2114 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNMI VRSCKCS 398 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 127863 seqs, 47026705 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



127863 



Database : 



SwissProt 41:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



) . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


2035 


96.3 


390 


1 


TGFl PIG 


P07200 sus scrofa 


2 


1934 


91.5 


390 


1 


TGFl^SHEEP 


P50414 ovis aries 


3 


1921 


90. 9 


390 


1 


TGF1__HUMAN 


P01137 homo sapien 


4 


1919 


90.8 


390 


1 


TGF1__CERAE 


P09533 cercopithec 


5 


1898 


89.8 


390 


1 


TGF1__CANFA 


P54831 canis famil 


6 


1844 


87.2 


390 


1 


TGF1_H0RSE 


019011 equus cabal 


7 


1840 


87.0 


390 


1 


TGFl MOUSE 


P04202 mus musculu 



8 


1840 


87 , 


. 0 


390 


1 


TGF1_RAT 


P17246 


raCtus norv 


9 


1831 


86 . 


. 6 


390 


1 


TGFl CAVPO 


Q9zly6 


cavia porce 


10 


1577 


74 , 


. 6 


315 


1 


TGF1_B0VIN 


P18341 


bos taurus 


11 


1059 . 5 


50 , 


. 1 


373 


1 


TGF1_CHICK 


P09531 


gallus gall 


12 


873 


41 . 


. 3 


382 


1 


TGF1_XENLA 


P16176 


xenopus lae 


13 


857 


40 . 


. 5 


412 


1 


TGF3 CHICK 


P16047 


gallus gall 


14 


838 . 5 


39 


. 7 


409 


1 


TGF3_PIG 


P15203 


sus scrofa 


15 


835 


39 , 


. 5 


410 


1 


TGF3_M0USE 


P17125 


mus musculu 


16 


834 


39 , 


. 5 


412 


1 


TGF3_RAT 


Q07258 


rattus norv 


17 


832 


39 , 


. 4 


412 


1 


TGF3_HUMAN 


P10600 


homo sapien 


18 


831.5 


39 . 


. 3 


4 14 


1 


TGF2_M0USE 


P27090 


mus musculu 


19 


816.5 


38 


. 6 


4 14 


1 


TGF2_HUMAN 


P08112 


homo sapien 


20 


816 


38 . 


. 6 


413 


1 


TGF2 XENLA 


P17247 


xenopus lae 


21 


815.5 


38 , 


. 6 


412 


1 


TGF2_CHICK 


P30371 


gallus gall 


22 


812,5 


38 . 


. 4 


435 


1 


TGF2 PIG 


P09858 


sus scrofa 


23 


812 . 5 


38 . 


. 4 


442 


1 


TGF2__RAT 


Q07257 


rattus norv 


24 


482 


22 


. 8 


112 


1 


TGF2_B0VIN 


P21214 


bos taurus 


25 


339 


16 


. 0 


375 


1 


GDF8 MELGA 


042221 


meleagris g 


26 


337 


15 . 


. 9 


375 


1 


GDF8 CHICK 


042220 


gallus gall 


27 


327 


15 


. 5 


375 


1 


GDF8_PIG 


018831 


sus scrofa 


28 


326 


15 , 


. 4 


375 


1 


GDF8_PAPHA 


018828 


papio hamad 


29 


324 


15 . 


. 3 


376 


1 


GDF8 RAT 


035312 


rattus norv 


30 


323 


15 


. 3 


375 


1 


GDF8 HUMAN 


014793 


homo sapien 


31 


322 


15 


. 2 


376 


1 


GDF8 MOUSE 


008689 


mus musculu 


32 


321 


15 


. 2 


375 


1 


GDF8__SHEEP 


018830 


ovis aries 


33 


3 12 


14 


. 8 


375 


1 


GDF8 BOVIN 


018836 


bos taurus 


34 


303 . 5 


14 


. 4 


405 


1 


GDFB MOUSE 


Q9zlw4 


mus musculu 


35 


303 . 5 


14 


. 4 


407 


1 


GDFB HUMAN 


095390 


homo sapien 


36 


291 


13 


. 8 


374 


1 


GDF8 BRARE 


042222 


brachydanio 
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38 


281 . 5 


13 


.3 


425 


1 


IHBA~SHEEP 


P43032 


ovis aries 


39 


278 


13, 


.2 


424 


1 


IHBA_PIG 


P03970 


sus scrofa 


40 


277 


13 


. 1 


424 


1 


IHBA_MOUSE 


Q04998 


mus musculu 


41 


277 


13 


. 1 


424 


1 


IHBA_RAT 


P18331 


rattus norv 


42 


276.5 


13 


. 1 


425 


1 


IHBA^BOVIN 


P07995 


bos taurus 


43 


276 


13 


.1 


426 


1 


IHBA_HORSE 


P55102 


equus cabal 


44 


275 


13 


.0 


426 


1 


IHBA_HUMAN 


P08476 


homo sapien 


45 


270 


12 


.8 


424 


1 


IHBA_CHICK 


P27092 


gallus gall 



ALIGNMENTS 



RESULT 1 

TGF1_PIG 

ID TGF1_PIG STANDARD; PRT; 3 90 AA. 

AC P07200; P08832; 

DT Ol-APR-1988 (Rel . 07, Created) 

DT 01 -APR- 198 8 (Rel. 07, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFBl . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Suina; Suidae; Sus . 

OX NCBI_TaxID=9823 ; 
RN [1] 



RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX MEDLINE=87174844 ; PubMed=3470708 ; 

RA Derynck R . , Rhee L . ; 

RT "Sequence of the porcine transforming growth factor-beta precursor."; 

RL Nucleic Acids Res. 15:3187-3187(1987). 

RN [2] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RC STRAIN=Miniature swine; 

RX MEDLINE=8 9054 010; PubMed=24 6 1367 ; 

RA Kondaiah P., van Obberghen-Schill ing E., Ludwig R.L., Dhar R. , 

RA Sporn M.B., Roberts A.B.; 

RT "cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 

RT Evidence for alternate splicing and polyadenylation. " ; 

RL J. Biol. Chem. 263:18313-18317(1988). 

RN [31 

RP SEQUENCE FROM N.A. , AND VARIANT VAL-114. 

RX MEDLINE=88335639; PubMed=3 16652 0 ; 

RA Jakowlew S.B,, Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Nucleotide sequence of chicken transforming growth factor-beta 1 

RT (TGF-beta 1) . " ; 

RL Nucleic Acids Res. 16:8730-8730(1988). 

RN [4] 

RP SHOWS THAT REF . 3 SEQUENCE IS FROM PIG. 

RA Jakowlew S.B.; 

RL Unpublished observations (MAR-1996) . 

RN [5] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RA Wimmers K. , Chomdej S., Ponsuksili S., Schellander K. ; 

RT "Polymorphism in the porcine transforming growth factor beta 1 

RT gene . " ; 

RL Submitted (DEC-2001) to the EMBL/GenBank/DDBJ databases. 

RN [6] 

RP SEQUENCE OF 279-322. 

RX MEDLINE=8 71028 90; PubMed=287 9635 ; 

RA Cheifetz S., Weatherbee J. A., Tsang M.L.S., Anderson J.K., Mole J.E., 

RA Lucas R., Massague J.; 

RT "The transforming growth factor-beta system, a complex pattern of 

RT cross-reactive ligands and receptors."; 

RL Cell 48:409-415(1987). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC -!- CAUTION: REF. 3 SEQUENCE WHICH WAS SAID TO ORIGINATE FROM CHICKEN 
CC WHITE LEGHORN, SEEMS (REF. 4) TO ORIGINATE FROM PIG. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 
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1; 



GF__cysknot . 
TGF_TGFb . 
TGFb. 
TGFb N. 



entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 

EMBL; YOOlll; CAA68291.1; 
EMBL; M23703; AAA64616,1; 
EMBL; X12373; CAA30933.1, 
EMBL; AF461808; AAL57902 . 
PIR; A27512; A27512 . 
HSSP; P01137; IKLA. 
I nt er Pro ; I PRO 024 0 0; 
InterPro; IPR003911; 
InterPro; I PRO 0183 9; 
InterPro; IPROOllll; 
Pfam; PF00019; TGF-beta; 1. 
Pfam; PF00688; TGFb_propeptide; 1. 
PRINTS; PRO 043 8; GFCYSKNOT. 
PRINTS; PR01423; TGFBETA. 
ProDom; PD000357; TGFb; 1. 
SMART; SM002 04; TGFB; 1. 
PROSITE; PS00250; TGF_BETA_1 ; 1. 

Growth factor; Mitogen; Glycoprotein; Signal; Polymorphism. 
SIGNAL 1 23 POTENTIAL. 

PROPER 24 278 

CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

DISULFID 285 294 BY SIMILARITY. 

DISULFID 293 356 BY SIMILARITY. 

DISULFID 322 387 BY SIMILARITY. 

DISULFID 326 389 BY SIMILARITY. 

DISULFID 355 355 INTERCHAIN (BY SIMILARITY) . 

CARBOHYD 82 82 N-LINKED (GLCNAC. . .) (POTENTIAL). 

CARBOHYD 136 136 N-LINKED (GLCNAC. . .) (POTENTIAL), 

CARBOHYD 176 176 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SITE 244 246 CELL ATTACHMENT SITE (POTENTIAL) . 

VARIANT 114 114 L -> V. 

CONFLICT 6 7 LR -> PG (IN REF . 3). 

CONFLICT 180 180 R -> G (IN REF. 3) . 

CONFLICT 237 237 N -> NA (IN REF. 3). 

SEQUENCE 390 AA; 44294 MW; A6E2C3659FC384E6 CRC64 ; 



Query Match 96.3%; Score 2035; DB 1; Length 390; 

Best Local Similarity 97.0%; Pred. No. 9e-154; 

Matches 386; Conservative 1; Mismatches 3; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIMMIIIIMMMIMIIIIIIIIIMIIIIIIMIIIIIIIIIMIMIIM 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

MIIIIIMIIIIIIIIIIIIIIMIIIMIIIMIMIMIIIIIIMIIIhllMII 

Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMLESGNQI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

IIMIMIMIIIIMIIIIIIIIIIIIIMIIIIIIIMIIIIMIMIIIIMMIM 

Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

IIMIIIIMIMIMIIIIIIIIMMIIMIIMIIIIII I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 3 00 

IIIMIMIIMIIMIIIIIMIMMIMMMIM IMIIIIIMMII 

Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 2 93 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIIIIIIIIIIIMIMIMIMIIIIIIIIII 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 

RESULT 2 
TGF1_SHEEP 

ID TGF1_SHEEP STANDARD; PRT; 390 AA. 

AC P50414; 

DT Ol-OCT-1996 (Rel . 34, Created) 

DT Ol-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFBl . 

OS Ovis aries (Sheep) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Rummantia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Ovis. 

OX NCBI_TaxID=994 0; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=95121932; PubMed=7821809 ; 

RA Woodall C. J. , McLaren L.J., Watt N.J.; 

RT "Sequence and chromosomal localisation of the gene encoding ovine 

RT latent transforming growth factor-beta 1."; 

RL Gene 150:371-373(1994). 

RN [2] 

RP SEQUENCE OF 281-390 FROM N.A. 

RC STRAIN=Merino; TISSUE=Skin; 

RX MEDLINE=95268698; PubMed=7749621 ; 

RA Sutton R., Ward W.G., Raphael K.A. , Cam G.R.; 

RT "Growth factor expression in skin during wool follicle development."; 

RL Comp. Biochem. Physiol. HOB : 697-705 ( 1995 ) . 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformat ics and the EMBL outstation - 

CC the European Bioinformat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



cc 
cc 
cc 

cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; X76916; CAA54242.1; 

EMBL; L36038; AAA31526.1; -. 

PIR; 146463; 146463. 

HSSP; P01137; IKLA. 

InterPro; IPR002400; GF_cysknot . 

InterPro; IPR003911; TGF_TGFb. 

InterPro; IPR001839; TGFb. 

InterPro; IPROOllll; TGFb__N. 

Pfam; PF00019; TGF-beta; 1. 

Pfam; PF00688; TGFb_propept ide ; 1. 

PRINTS; PR004 38; GFCYSKNOT. 

PRINTS; PR01423; TGFBETA. 

ProDom; PD000357; TGFb; 1. 

SMART; SM00204; TGFB; 1. 

PROSITE; PS002 50; TGF_BETA_1 ; 1. 

Growth factor; Mitogen; Glycoprotein; Signal. 



SIGNAL 


1 


23 


POTENTIAL . 


PROPER 


24 


278 


POTENTIAL . 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


DISULFID 


285 


294 


BY SIMILARITY. 


DISULFID 


293 


356 


BY SIMILARITY. 


DISULFID 


322 


387 


BY SIMILARITY. 


DISULFID 


326 


389 


BY SIMILARITY. 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC- . .) (POTENTIAL) 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


) SEQUENCE 


390 AA; 


44291 MW; 1C247299484D0E57 CRC64; 


Query Match 




91.5%; 


Score 1934; DB 1; Length 390; 


Best Local Similarity 


91.7%; 


Pred. No. 9e-146; 


Matches 365; 


Conservative 


10; Mismatches 15; Indels 8; 



Gaps 



1; 



Qy 
Db 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 

I IIIIIIIIIIIIMIMUIIIII IIIIIIMIIIMIMI IIIIIIIIIMIIII 

1 MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTIDMELVKRKGIEAIRGQILSKLRLA 



60 



60 



Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

IIIIMIIIIIIIIIhllllMMIIIMM I IIIIIMIIIIIIIIMIII Ihl 

61 SPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI 120 

121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

III I : Ihll llllllllllllllllMhUIIIIIMIIIIIIIIIIIIhlll 

121 YDKMKSSSHSIYMFFNTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

IIIMIIIIIIIIIMIIIMIIIIIIII II lllllllll I IMIIII hlllh 

181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 
241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

IIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIII Illlllllllllli 

241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 292 



Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

MIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIII 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 3 
TGF1_HUMAN 

ID TGF1_HUMAN STANDARD; PRT; 3 90 AA. 

AC P01137; Q9UCG4; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT Ol-FEB-1991 (Rel . 11, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFBl OR TGFB . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=87174845; PubMed=34 7 070 9 ; 

RA Derynck R. , Rhee L. , Chen E.Y., van Tilburg A.; 

RT "Intron-exon structure of the human transforming growth factor-beta 

RT precursor gene . " ; 

RL Nucleic Acids Res. 15:3188-3189(1987). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=85296301; PubMed=386194 0 ; 

RA Derynck R., Jarrett J. A., Chen E.Y,, Eaton D.H., Bell J.R., 

RA Assoian R.K., Roberts A.B., Sporn M.B., Goeddel D.V.; 

RT "Human transforming growth factor-beta complementary DNA sequence and 

RT expression in normal and transformed cells."; 

RL Nature 316:701-705(1985). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Duodenum, and Eye; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H. , Derge J.G. , 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T, , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A.A. , Rubin G.M., Hong L. , 

RA Stapleton M., Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S. , Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A,C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D,, Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 



RA Butterfield Y.S.N., Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [4] 

RP SEQUENCE OF 279-390 FROM N.A. 

RC TISSUE=Carcinoma ; 

RA Urushizaki Y., Niitsu Y., Terui T., Koshida Y, , Mahara K. , Kohgo Y., 

RA Urushizaki I., Takahashi Y. , Ito H.; 

RT "Cloning and expression of the gene for human transforming growth 

RT factor-beta in Escherichia coli."; 

RL Tumor Res. 22:41-55(1987). 

RN [5] 

RP SEQUENCE OF 279-329. 

RC TISSUE=Bladder carcinoma; 

RX MEDLINE=93229900; PubMed=84 71846 ; 

RA Bourdrel L. , Lin C.-H., Lauren S.L., Elmore R.H,, Sugarman B.J., 

RA Hu S., Westcott K.R.; 

RT "Recombinant human transforming growth factor-beta 1: expression by 

RT Chinese hamster ovary cells, isolation, and characterization."; 

RL Protein Expr. Purif. 4:130-140(1993). 

RN [6] 

RP SEQUENCE OF 279-301. 

RX MEDLINE=85131019; PubMed=2 982829 ; 

RA Massague J. , Like B. ; 

RT "Cellular receptors for type beta transforming growth factor. Ligand 

RT binding and affinity labeling in human and rodent cell lines."; 

RL J. Biol. Chem. 260:2636-2645(1985). 

RN [7] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE=93144319; PubMed=8424 942 ; 

RA Archer S.J., Bax A., Roberts A.B., Sporn M.B., Ogawa Y. , Piez K.A. , 

RA Weatherbee J.A. , Tsang M.L.-S., Lucas R. , Zheng B.-L., Wenker J., 

RA Torchia D.A. ; 

RT "Transforming growth factor beta 1: NMR signal assignments of the 

RT recombinant protein expressed and isotopically enriched using Chinese 

RT hamster ovary cells."; 

RL Biochemistry 32:1152-1163(1993). 

RN [8] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE=93144320; PubMed=8424 943 ; 

RA Archer S.J., Bax A., Roberts A.B. , Sporn M.B., Ogawa Y. , Piez K.A. , 

RA Weatherbee J.A. , Tsang M.L.-S., Lucas R., Zheng B.-L., Wenker J., 

RA Torchia D.A. ; 

RT "Transforming growth factor beta 1: secondary structure as determined 

RT by heteronuclear magnetic resonance spectroscopy."; 

RL Biochemistry 32:1164-1171(1993). 

RN [9] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE=96266150; PubMed=8679613 ; 

RA Hinck A. P., Archer S.J., Qian S.W., Roberts A.B., Sporn M.B., 

RA Weatherbee J.A., Tsang M.L.-S., Lucas R., Zheng B.-L., Wenker J., 

RA Torchia D.A. ; 

RT "Transforming growth factor beta 1: three-dimensional structure in 

RT solution and comparison with the X-ray structure of transforming 

RT growth factor beta 2 . " ; 



RL Biochemistry 35:8517-8534(1996). 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide- 1 inked . 
CC SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstacion - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license®isb-sib. ch) . 

CC 

DR EMBL; X05839; CAA29283.1; -. 

DR EMBL; X05840; CAA29283.1; JOINED. 

DR EMBL; X05843; CAA29283.1; JOINED. 

DR EMBL; X05844; CAA29283.1; JOINED. 

DR EMBL; X05849; CAA29283.1; JOINED. 

DR EMBL; X05850; CAA29283.1; JOINED. 

DR EMBL; X02812; CAA26580.1; ALT_SEQ. 

DR EMBL; BC001180; AAH01180.1; ~. 

DR EMBL; BC000125; AAH00125.1; -. 

DR EMBL; BC022242; AAH22242.1; -. 

DR EMBL; M38449; AAA36735.1; 

DR PIR; A27513; WFHU2 . 

DR PDB; IKLA; 17-AUG-96. 

DR PDB; IKLC; 17-AUG-96. 

DR PDB; IKLD; 17-AUG-96. 

DR Genew; HGNC: 11766; TGFBl . 

DR MIM; 190180; -. 

DR GO; GO: 0006916; P : ant i -apoptOSiS ; TAS . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPROOllll; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS 00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal; 3D-structure . 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 278 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 

FT DISULFID 293 356 

FT DISULFID 322 387 

FT DISULFID 326 389 

FT DISULFID 355 355 INTERCHAIN. 



FT 
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ft? 
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.) (POTENTIAL) 


FT 
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.) (POTENTIAL) 


FT 
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N-LINKED (GLCNAC- . 


.) (POTENTIAL) 


FT 


S ITE 
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FT 
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FT 


TURN 


302 


3 05 






FT 


STRAND 


313 


313 






FT 


STRAND 


317 


317 






FT 


STRAND 


321 


323 






FT 


HELIX 


335 


346 






FT 


TURN 


348 


349 






FT 


STRAND 


358 


370 






FT 


TURN 


371 


372 






FT 


STRAND 


373 


387 






SQ 


SEQUENCE 


390 AA; 


44341 MW; 753 91614250288FE 


CRC64; 


Query Match 




90.9%; 


Score 1921; DB 1; 


Length 3 90; 


Best Local Similarity 


91.7%; 


Pred- No. 9.6e-145; 




Matches 365; 


Conservative 


10; Mismatches 15; 


Indels 8; 



Gaps 



1; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


293 


Qy 


361 


Db 


353 



MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

I lllllll llllllllllllllllllllllllllllllllllllllllllllllllll 
MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRIA 60 

SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhlllllllllllllllllllllllll lllllllllllllllllllllh hi 
SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill ' Ihll MIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMhIII 

YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 
YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

IIIIIIIIIIIIIIIIIIIIIMIIMhl lllllllll I Ihllll hllll 

YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFT 24 0 
SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

UIIMIIIIIIIIIIIIIIIIIIIIIIIII IMIM llllllllllllll 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 292 

CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 60 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIM 
CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 
APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 4 
TGF1__CERAE 

ID TGF1__CERAE STANDARD; PRT; 390 AA. 



AC P09533; 

DT Ol-MAR-1989 (Rel . 10, Created) 

DT Ol-MAR-1989 (Rel. 10, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFBl . 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=9534 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=87246074; PubMed=3474 13 0 ; 

RA Sharpies K. , Plowman G.D., Rose T.M., Twardzik D.R., Purchio A.F.; 

RT "Cloning and sequence analysis of simian transforming growth 

RT factor-beta cDNA."; 

RL DNA 6:239-244(1987). 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROL PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright- It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license®isb-sib . ch) . 

CC 

DR EMBL; M16658; AAA35369.1; 

DR PIR; A26960; A26960. 

DR HSSP; P01137/ IKLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb- 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 16 POTENTIAL. 

FT PROPEP 17 278 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 BY SIMILARITY. 

FT DISULFID 293 356 BY SIMILARITY. 

FT DISULFID 322 387 BY SIMILARITY. 



FT 


DISULFID 


326 


389 


BY SIMILARITY 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) , 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44356 MW; 


DFF63E2BAB44320E CRC64 ; 


Query Match 




90.8%; 


Score 1919; DB 1; Length 3 90; 



Best Local Similarity 91,5%; Pred. No. 1.4e-144; 

Matches 364; Conservative 10; Mismatches 16; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIIIMIIIIIIIIIII IIIIIIIIIIIIIIIIIIIIIM lllllllillll 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTIDMELVKRKRIETIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

lllllhlllllllllllllllllMIIIIII IIIIIIIIIIIIIIIIIIIIM: hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Mill : l|:|| IIIIIIIIIIIIMIIIII II IIIIIIIIMIIIIIIIIIII: III 
Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

IIMIIIMhIIIIIIIIIIIIIIIIMI IIIIIIIM I IMIIII hill! 

Db 181 YLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSKDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

MIIMIIIIIIIIIIIIIIIMIIMIIM llllll llllllllllllll 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IMIIIIIIIIIMIIIIIIIMIMIIIIMIIIIMIIIIIIIIIIIIIIIIIMIII 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIMIIIIIIIIIIIIIIIIMIMIIIMIIIMI 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 5 
TGF1_CANFA 

ID TGF1_CANFA STANDARD; PRT; 390 AA, 

AC P54831; 

DT Ol-OCT-1996 (Rel . 34, Created) 

DT Ol'OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFBl. 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
OC Mammalia; Eutheria; Camivora; Fissipedia; Canidae; Canis. 
OX NCBI_TaxID=9615; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE= Jugular vein endothelial; 



RX MEDLINE=95237630; PubMed=772 1 1 1 0 ; 

RA Manning A.M., Auchampach J . A . , Drong R . F . , SI ightom J . L . ; 

RT "Cloning of a canine cDNA homologous to the human transforming growth 

RT factor-beta 1-encoding gene."; 

RL Gene 155:307-308(1995). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul f ide- 1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; L34956; AAA51458.1; -. 

DR PIR; JC4 023; JC4023 . 

DR HSSP; P01137; IKLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PRO 1423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL . 




FT 


PROPEP 


24 


278 


BY SIMILARITY. 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH 


FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 




FT 


DISULFID 


293 


356 


BY SIMILARITY. 




FT 


DISULFID 


322 


387 


BY SIMILARITY. 




FT 


DISULFID 


326 


389 


BY SIMILARITY. 




FT 


DISULFID 


355 


355 


INTERCHAIN. 




FT 


CARBOHYD 


82 


82 


N -LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N- LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N- LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44185 MW 


• EB4780E88B7B590E 


CRC64; 


Query Match 




89.8%; 


Score 18 98; DB 1; 


Length 3 90; 



Best Local Similarity 90.7%; Pred. No. 6.4e-143; 

Matches 361; Conservative 12; Mismatches 17; Indels 8; Gaps 1; 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRIA 60 



I IIIIMIIIIIIM IIIIIMIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIh 

Db 1 MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLS 60 

Qy 61 SPPSQGDVPPGPLPEAVLiALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

lllllhlll IIIIIIIIMIIIIIIIIIII IIIMIIIIIIIIIIIIIIIIh hi 

Db 61 SPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVENTNKI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I :|lhlllllMIIIIIIIIIIIIIIIIIIIIIIIM llllllllllllllll 
Db 121 YEKVKKSPHSIYMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

IIIIIIMIIhlllMIIIIIMIIIh UIIIIMI I lllllll hlllh 

Db 181 YLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 3 00 

I IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII II IIIIMIIIIIIM 

Db 241 SSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQRR ALDTNYCFSSTEKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIMIIMIIMMIMIIMIIIIIMIIIMIIMIMMIMIIIMIIMIIIMI 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MIIIMIIIIMIIIMIIIIIIIIIIIIIIIIIIII 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 

RESULT 6 
TGF1_H0RSE 

ID TGF1_H0RSE STANDARD; PRT; 390 AA . 

AC 019011; 

DT 15-JUL-1998 (Rel . 36, Created) 

DT 15-JUL-1998 (Rel . 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFBl. 

OS Equus caballus (Horse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Perissodactyla; Equidae; Equus. 

OX NCBI_TaxID=:9796 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lymph node; 

RX MEDLINE=98185507; PubMed=9524819 ; 

RA Penha-Goncalves M.N. , Onions D.E., Nicolson L. ; 

RT "Cloning and sequencing of equine transforming growth factor-beta 1 

RT (TGF beta-1) cDNA."; 

RL DNA Seq. 7:375-378(1997). 

CC FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC SUBUNIT: Homodimer; disulf ide-linked . 

CC -1- SUBCELLULAR LOCATION: Secreted. 



CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 
CC the European Bioinformat ics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; X99438; CAA67801.1; -. 
DR HSSP; P01137; IKLA. 
DR InterPro; IPR002400; GF_cysknot . 
DR InterPro; IPR003911; TGF_TGFb. 
DR InterPro; I PRO 0183 9; TGFb. 
DR InterPro; IPROOllll; TGFb_N. 
DR Pfam; PF00019; TGF-beta; 1. 
DR Pfam; PF00688; TGFb_propept ide ; 1. 
DR PRINTS; PR00438; GFCYSKNOT. 
DR PRINTS; PR01423; TGFB ETA . 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



Matches 352; Conservative 12; Mismatches 26; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIIIIIIIIMIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 
Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRIA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

lllllhllllllllllllllllM :|IIM I llllllllllllllllllll hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRAQVAGESAETEPEPEADYYAKEVTRVLMVEKENEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

I : Ihll lllllll llhhilllllllllllll llllllllllllhIM 
Db 121 YKTVETGSHSIYMFFNTSELRAAVPDPMLLSRAELRLLRLKLSVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

lllllll IIIIIIIIIIIIIIIIIMh: hMIIIIII I lllllll I lllh 
Db 181 YLSNRLLTPSDSPEWLSFDVTGWRQWLSQGGAMEGFRLSAHCSCDSKDNTLRVGINGFS 24 0 



ProDom; PD000357; 


TGFb; 


1. 




SMART; SM00204; TGFB; 1, 






PROSITE; 


PS00250; 


TGF_BETA_1 ; 


1 . 


^ Growth factor; Mitogen; 


Glycoprotein; Signal . 


SIGNAL 


1 


23 




POTENTIAL. 


PROPEP 


24 


278 




BY SIMILARITY. 


CHAIN 


279 


390 




TRANSFORMING GROWTH FACTOR BETA 1. 


DISULFID 


285 


294 




BY SIMILARITY. 


DISULFID 


293 


356 




BY SIMILARITY. 


DISULFID 


322 


387 




BY SIMILARITY. 


DISULFID 


326 


389 




BY SIMILARITY. 


DISULFID 


355 


355 




INTERCHAIN (BY SIMILARITY) . 


CARBOHYD 


82 


82 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


136 


136 




N - L I NKED ( GLCNAC . . . ) ( POTENT I AL ) 


CARBOHYD 


176 


176 




N-LINKED (GLCNAC, . .) (POTENTIAL) 


) SEQUENCE 


390 AA; 


43974 MW; 


A86D715F44549691 CRC64 ; 


Query Match 




87 


.2%; 


Score 1844; DB 1; Length 390; 


Best Local Similarity 88 


.4%; 


Pred. No. 1.2e-138; 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 3 00 

I llllllll llllllllllllllllll llllllll Illlllllllllll 
Db 241 SSRRGDLATIDGMNRPFLLLMATPLERAQQLHSSRHRR ALDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

lllllll llllllllllllllllllllllllllllll 
Db 353 APCCVPQVLEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 7 

TGFl MOUSE 



ID TGF1_M0USE STANDARD; PRT; 3 90 AA. 

AC P04202; 

DT 20-MAR-1987 (Rel . 04, Created) 

DT 20-iy[AR-1987 (Rel. 04, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) , 

GN TGFBl . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi; 

DC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=86168129; PubMed=3 007454 ; 

RA Derynck R., Jarrett J. A., Chen E.Y., Goeddel D.V, ; 

RT "The murine transforming growth factor-beta precursor."; 

RL J. Biol. Chem. 261:4377-4379(1986). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RX MEDLINE=96096545; PubMed=8522200 ; 

RA Guron C. , Sudarshan C. , Raghow R.; 

RT "Molecular organization of the gene encoding murine transforming 

RT growth factor beta 1 . " ; 

RL Gene 165:325-326(1995). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6, and NOD/LT; TISSUE=Spleen ; 

RA Poirot L., Benoist C. , Mathis D. ; 

RT "Transforming growth factor-beta 1 sequence and expression: no 

RT difference between NOD/Lt and C57B1/6 mouse strains."; 

RL Submitted (AUG-1998) to the EMBL/GenBank/DDBJ databases, 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 



DR 


EMBL ; 


M13177; 


AAA40423 


1, 




DR 


EMBL; 


L42462; 


AAB00138 


1, 




DR 


EMBL; 


L42456; 


AAB00138 


1, 


JOINED 


DR 


EMBL; 


L42457; 


AAB00138 


1, 


JOINED 


DR 


EMBL; 


L42458; 


AAB00138 


1, 


JOINED 


DR 


EMBL; 


L42459; 


AAB00138 


1, 


JOINED 


DR 


EMBL ; 


L42460; 


AAB00138 


1, 


JOINED 


DR 


EMBL; 


L42461; 


AAB00138 


1, 


JOINED 


DR 


EMBL; 


AJ009862; CAA08900. 


1; - . 


DR 


PIR; 


A01396; 


WFMS2 . 






DR 


HSSP; 


P01137; 


IKLA. 






DR 


MGD; 


MGI: 98725; Tgf bl . 







DR GO; GO:0005578; C: extracellular matrix; IDA. 

DR GO; GO: 0006954; P : inflammatory response; IMP. 

DR GO; GO: 0007515; P: lymph gland development; IMP. 

DR GO; GO: 0008220; P:necrosiS; IMP. 

DR GO; GO: 0016202; P:regulation of myogenesis; IDA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; I PRO 03 911; TGF_TGFb. 

DR InterPro; IPR00183 9; TGFb . 

DR InterPro; IPROOllll; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD0 0 03 57; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44310 


MW; 4381A51B711D689E CRC64 ; 



Query Match 87.0%; Score 1840; DB 1; Length 390; 

Best Local Similarity 86.9%; Pred. No. 2.5e-138; 

Matches 346; Conservative 15; Mismatches 29; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I lllllllllllll IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 



Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRIA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

IIMIhllllllllllllMIIIIIIIIIII :|ll!lllllllllllllllh II 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I Ihll lllhHIMIII Mlllllll III llllllllllllhlll 
Db 121 YEKTKDISHSIYMFFNTSDIREAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

II INI hhllllllllllllllll : : hill III I Mill lllllll : 
Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNKLHVEINGIS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

lllll III IIIIIIIIIIMIIIIIIIIIIIIII Illlllllllllll 
Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHWPGASA 360 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 
Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

:||IIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIII 
Db 353 SPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 

RESULT 8 
TGF1_RAT 

ID TGF1_RAT STANDARD; PRT; 390 AA. 

AC P17246; 

DT Ol-AUG-1990 (Rel. 15, Created) 

DT Ol-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFBl . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Heart ; 

RX MEDLINE=90272425; PubMed=234 9108 ; 

RA Qian S.W., Kondaiah P., Roberts A.B., Sporn M.B.; 

RT "cDNA cloning by PCR of rat transforming growth factor beta-1."; 

RL Nucleic Acids Res. 18:3059-3059(1990). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; X524 98; CAA36741.1; -. 

DR PIR; S10219; S10219. 

DR HSSP; P01137; IKLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR0043 8; GFCYSKNOT. 

DR PRINTS; PRO 142 3; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED [GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44329 MW; 


5E21108ED50D853C CRC64; 


Query Match 




87. 0%; 


Score 1840; DB 1; Length 390; 



Best Local Similarity 86.9%; Pred, No. 2.5e-138; 

Matches 346; Conservative 14; Mismatches 30; Indels 8; Gaps 1; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIMIIIIIilll llllllllllllillllllllllllllllMIIIIIIIIIIIII 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhlllllllMMIIIIIIIMIIIII :|IIIIIIIMIIIIIIMIh II 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSR^ELRLLRLKLKVEQHVELYQKYSNDSWR 180 

III I Ihll IllhUIIIIII IMIIIIII I I llllllllllllhlll 
Db 121 YDKTKDITHS I YMFFNTSDI REAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

II MM hIMIIIIIIIIIIIIMI : hill III I Mill IIIIIM : 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNVLHVEINGIS 24 0 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

Mill Ml IIIIIMIIMIMIIIIIIIIIIII llllllllllllll 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

IIIIIIIIIIIIIIIIIMIMIIMIIIIIIIIIIIIMIIIIMIIIIIMIIIIIII 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MMIMIIIIIIIIMIIIMMMIIIIIIIIIIII 

Db 353 SPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 

RESULT 9 

TGF1_CAVP0 

ID TGF1_CAVP0 STANDARD; PRT; 3 90 AA. 

AC Q9Z1Y6; Q9QZB3 ; Q9R148/ 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel . 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) , 

GN TGFBl. 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141; 

RN [1] 

RP SEQUENCE FROM N,A. 

RC STRAIN=Hartley; 

RA Jeevan A., McMurray D.N., Yoshimura T. ; 

RT "Guinea pig transforming growth factor-beta in peritoneal exudates 

RT after BCG vaccination."; 

RL Submitted (OCT-1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 265-382 FROM N.A. 

RX MEDLINE=99144670; PubMed=1002B97 8 ; 

RA Scarozza A.M., Ramsingh A.I., Wicher V., Wicher K. ; 

RT "Spontaneous cytokine gene expression in normal guinea pig blood and 

RT tissues . " ; 

RL Cytokine 10:851-859(1998). 

RN [3] 

RP SEQUENCE OF 279-371 FROM N.A. 

RC STRAIN=Hartley; TISSUE=Trachea ; 

RA Morishima Y., Uchida Y. , Nomura A., Ishii Y. , Sakamoto T. , 

RA Sekizawa K. ; 

RT "Guinea-pig transforming growth factor-beta expression in injured 

RT tracheal epithelium."; 

RL Submitted (JUL-1999) to the EMBL/GenBank/DDBJ databases. 

CC FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 

CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 



cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license®isb-sib . ch) . 

CC 

DR EMBL; AF191297; AAF02780.1; -. 

DR EMBL; AF097509; AAC83807.1; -, 

DR EMBL; AF169347; AAD49347.1; -. 

DR HSSP; P01137; IKLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; I PRO 03 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


24 


POTENTIAL. 




FT 


PROPEP 


25 


278 


POTENTIAL. 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH 


FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 




FT 


DISULFID 


293 


356 


BY SIMILARITY. 




FT 


DISULFID 


322 


387 


BY SIMILARITY. 




FT 


DISULFID 


326 


389 


BY SIMILARITY. 




FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N- LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N- LINKED (GLCNAC. • 


.) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


CONFLICT 


279 


279 


G -> P (IN REF. 3) . 




FT 


CONFLICT 


286 


286 


F -> S (IN REF. 2) . 




FT 


CONFLICT 


309 


309 


K -> E (IN REF. 2) . 




FT 


CONFLICT 


322 


322 


C -> R (IN REF. 2) . 




FT 


CONFLICT 


350 


350 


A -> G (IN REF. 2) . 




SQ 


SEQUENCE 


390 AA; 


44328 


MW; 153 9F84 9BA0C0FF1 


CRC64; 



Query Match 86.6%; Score 1831; DB 1; Length 390; 

Best Local Similarity 86.7%; Pred. No. 1.3e-137; 

Matches 345; Conservative 15; Mismatches 30; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 



Db 




Qy 



61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 



Db 




Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 



Db 121 YKSIETVAHSIYMFFNTSELREAVPDPLLLSRAELRMQRLKLNVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

lll|:|l llhl III III III I II III:: I :|||| Ml I lllllll Mill 
Db 181 YLSNQLLTPSDTPEWLSFDVTGWRQWLSQGEELEGFRFSAHCSCDSKDNTLRVEINGIG 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPL.ERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 3 00 

llllll lllllllllllllllllllllllllllll IIIMIIIIIIII 

Db 241 PKRRGDLAAIHGMNRPFLLLMATPLERAQHLHSSRHRR GLDTNYCFSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

I III MM III Mill M Mill IIMII Mill Illllllllllllllllllll Mill 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIIIIIIIIIMIIII IIIIMIIIIIIIIII 
Db 353 APCCVPQALEPLPIVYYVGRKAKVEQLSNMIVRSCKCS 390 

RESULT 10 
TGF1_B0VIN 

ID TGF1_B0VIN STANDARD; PRT; 315 AA. 

AC P18341/ 

DT Ol'NOV-1990 (Rel . 16, Created) 

DT Ol-NOV-1990 (Rel. 16, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (Fragment) . 

GN TGFBl . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91042552; PubMed=3 153459 ; 

RA van Obberghen-Schill ing E., Kondaiah P., Ludwig R.L., Sporn M.B. , 

RA Baker C.C. ; 

RT "Complementary deoxyribonucleic acid cloning of bovine transforming 

RT growth factor-beta 1."; 

RL Mol. Endocrinol. 1:693-698(1987). 

RN [2] 

RP SUBUNITS. 

RC TISSUE=Bone; 

RX MEDLINE=92129307; PubMed=1733 936 ; 

RA Ogawa Y., Schmidt D.K., Dasch J.R., Chang R.J., Glaser C.B.; 

RT "Purification and characterization of transforming growth factor-beta 

RT 2.3 and -beta 1.2 heterodimers from bovine bone."; 

RL J. Biol. Chem. 267:2325-2328(1992). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked . Heterodimers of TGF-beta 1/2 



CC have been found in bone. 

CC SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http : //www. isb-sib . ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; M36271; AAA3 0778.1; -. 

DR PIR; A40057; A40057. 

DR HSSP; P01137; IKLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; I PRO 01 83 9; TGFb. 

DR InterPro; IPROOllll; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 



DR 


PRINTS; 


PR00438; 


GFCYSKNOT. 




DR 


PRINTS; 


PR01423; 


TGFBETA . 




DR 


ProDom; 


PD000357; 


TGFb; 1. 




DR 


SMART; SM002 04; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1. 


KW 


Growth factor; Mitogen; Glycoprotein. 


FT 


NON_TER 


1 


1 




FT 


PROPER 


<1 


203 




FT 


CHAIN 


204 


315 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


210 


219 


BY SIMILARITY. 


FT 


DISULFID 


218 


281 


BY SIMILARITY. 


FT 


DISULFID 


247 


312 


BY SIMILARITY. 


FT 


DISULFID 


251 


314 


BY SIMILARITY. 


FT 


DISULFID 


280 


280 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


7 


7 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


61 


61 


N-LINKED (GLCNAC. , .) (POTENTIAL) 


FT 


CARBOHYD 


101 


101 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


169 


171 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


315 AA 


; 36269 MW; 


C2717A23D994E00E CRC64; 


Query Match 




74.6%; 


Score 1577; DB 1; Length 315; 



Best Local Similarity 91.3%; Pred. No. 1.3e-117; 

Matches 295; Conservative 9; Mismatches 11; Indels 8; Gaps 1; 
Qy 76 AVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLF 135 

hlllllllllllllll I llllllllllllllllllll IhlMI I : Ihll I 

Db 1 AILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKIYDKMKSSSHSIYMFF 60 

Qy 136 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

IIIIIMIMIIIIIMhHIIIIIIMIMIIIIIIIIhlMIIIIIIMIIIIIII 

Db 61 NTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 120 

Qy 196 LSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNR 255 

IIIIIIIIIIIIMIM MIIIIIM I IIIIMI hlllhlllllllllllllll 

Db 121 LSFDVTGWRQWLTRREEIEGFRLSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNR 18 0 



Qy 256 PFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLG 315 

IIIIIIIIIMIIIIIIIIIIII Illllllllllllllllllllllllllll 
Db 181 PFLLLI^IATPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQLYI DFRKDLG 232 

Qy 316 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 375 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIMIIIIIIII 

Db 233 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 2 92 

Qy 376 YYVGRKPKVEQLSNMIVRSCKCS 398 

Illllllllllllllllllllll 
Db 293 YYVGRKPKVEQLSNMIVRSCKCS 315 

RESULT 11 

TGF1_CHICK 

ID TGF1_CHICK STANDARD; PRT; 373 AA . 

AC P09531; 

DT Ol-MAR-1989 (Rel . 10, Created) 

DT Ol-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 4) 

DE (Fragment) . 

GN TGFBl . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=White leghorn; 

RX MEDLINE=89112198; PubMed=2464131 ; 

RA Jakowlew S.B., Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Complementary deoxyribonucleic acid cloning of a messenger 

RT ribonucleic acid encoding transforming growth factor beta 4 from 

RT chicken embryo chondrocytes."; 

RL Mol. Endocrinol. 2:1186-1195(1988). 

RN [2] 

RP REVISIONS. 

RX MEDLINE=92357039; PubMed=1353860 ; 

RA Burt D.W. , Jakowlew S.B.; 

RT "Correction: a new interpretation of a chicken transforming growth 

RT factor-beta 4 complementary DNA."; 

RL Mol. Endocrinol. 6:989-992(1992). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide- 1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 



CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M31160; AAB05637.1; 

DR PIR; A41918; A41918. 

DR HSSP; P01137; IKLA. 

DR InterPro; IPR003911; TGF__TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPROOllll; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb^propept ide ; 1. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT NON_TER 1 1 

FT SIGNAL <1 1 POTENTIAL. 

FT PROPEP 2 259 POTENTIAL. 

FT CHAIN 260 373 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 266 277 BY SIMILARITY. 

FT DISULFID 276 339 BY SIMILARITY. 

FT DISULFID 305 370 BY SIMILARITY. 

FT DISULFID 309 372 BY SIMILARITY. 

FT DISULFID 338 338 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 54 54 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 109 109 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 153 153 N-LINKED (GLCNAC. . .) (POTENTIAL), 

FT SITE 224 226 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 373 AA; 42634 MW; 9903F3479C8552E5 CRC64 ; 

Query Match 50.1%; Score 1059.5; DB 1; Length 373; 

Best Local Similarity 54.5%; Pred. No. 1.5e-76; 

Matches 208; Conservative 52; Mismatches 99; Indels 23; Gaps 8; 

Qy 30 LSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 89 

Illh :hl hllllhllllllllM :|l : II |lh I lllllh: ^ 
Db 2 LSTCQRLDLEAAKKKRIEAVRGQILSKLRLTAPPPASETPPRPLPDDVRALYNSTQELLK 61 

Qy 90 GES-VEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPV 148 

: : I |: :|:|||: |: j : : : :: ||:: :|1 | | | 
Db 62 QRARLRPPPDGPDEYWAKELRRIPMETTWDGAMEHWQPQSHSIFFVFNVSRARRG-GRPT 120 

Qy 14 9 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGV 2 03 

II llllhll I II :||||| I Mill I : : II III II I 

Db 121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 180 

Qy 204 VRQWLTRREAIEGFRLSAHSSSD---SKDNTLHVEINGFNSGRRGDLATIHGMNR--PFL 258 

I llh I : hi! I : : : III Ulh \ \ I- 

Db 181 VHQWLSGSELLGVFKLSVHCPCEMGPGHAEEMRISIEGFEQ-QRGDMQSIAKKHRRVPYV 239 

Qy 259 LLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCF--SSTEKNCCVRQLYIDFRKDLGW 316 

I II I III llhl III Ilhlll : lllllll lllllllll I 

Db 240 LAMALPAERANELHSARRRRD LDTDYCFGPGTDEKNCCVRPLYI DFRKDLQW 2 91 



Qy 317 KWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVY 376 

IIIIMIII I II hill Ml II lllhMIMIIIIIIIIIIIIIMII hi III: I 

Db 292 KWIHEPKGYMANFCMGPCPYIWSADTQYTKVLALYNQHNPGASAAPCCVPQTLDPLPIIY 351 

Qy 377 YVGRKPKVEQLSNMIVRSCKCS 398 

MM Mlllllhlhllll 
Db 352 YVGRNVRVEQLSNMWRACKCS 373 



RESULT 12 
TGF1_XENIA 

ID TGF1_XENLA STANDARD; PRT; 382 AA. 

AC P16176; 

DT Ol-APR-1990 (Rel . 14, Created) 

DT Ol-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 5) . 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCB I _Tax I D= 8 3 5 5 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=90110090; PubMed=22 95601 ; 

RA Kondaiah P., Sands M.J., Smith J.M. , Fields A., Roberts A.B., 

RA Sporn M.S., Melton D.A. ; 

RT "Identification of a novel transforming growth factor-beta (TGF-beta 

RT 5) mRNA in Xenopus laevis."; 

RL J. Biol. Chem. 265:1089-1093(1990). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Vempati U.D. , Kondaiah P.; 

RL Submitted (AUG-1997) to the EMBL/GenBank/DDBJ databases. 

CC -i- FUNCTION: IMPORTANT ROLE IN CERTAIN ASPECTS OF DIFFERENTIATION, 

CC SUBUNIT: Homodimer; disulfide-1 inked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute, There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; J05180; AAA49968.1; 

DR EMBL; AF009335; AAB64441.1; -. 

DR EMBL; AF009331; AAB64441.1; JOINED. 

DR EMBL; AF009332; AAB64441.1; JOINED. 

DR EMBL; AF009333; AAB64441.1; JOINED. 

DR EMBL; AF009334; AAB64441.1; JOINED. 

DR PIR; A34929; B61036. 

DR HSSP; P01137; IKLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; IPR003911; TGF TGFb. 



DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb__N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide ; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal, 

FT SIGNAL 1 21 POTENTIAL. 

FT 

FT CHAIN 271 382 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 277 286 BY SIMILARITY. 

FT DISULFID 285 348 BY SIMILARITY. 

FT DISULFID 314 379 BY SIMILARITY. 

FT DISULFID 318 381 BY SIMILARITY. 

FT DISULFID 347 347 INTERCHAIN (BY SIMILARITY) . 



SIGNAL 


1 


21 


PROPEP 


22 


270 


CHAIN 


271 


382 


DISULFID 


277 


286 


DISULFID 


285 


348 


DISULFID 


314 


379 


DISULFID 


318 


381 


DISULFID 


347 


347 


CARBOHYD 


73 


73 


CARBOHYD 


123 


123 


CARBOHYD 


166 


166 


SITE 


234 


236 


! SEQUENCE 


382 AA; 


442 


Query Match 




41 



FT CARBOHYD 73 73 N-LINKED (GLCNAC, . .) (POTENTIAL). 

FT CARBOHYD 123 123 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 166 166 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 234 236 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 382 AA; 44200 MW; 1034621C917AAE15 CRC64 ; 

Score 873; DB 1; Length 382; 

Best Local Similarity 46.9%; Pred . No. 9.2e-62; 

Matches 190; Conservative 55; Mismatches 122; Indels 38; Gaps 11; 

Qy 9 LPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDV 68 

: :| II MM I Mill MM h:MIIMIIIIIIIIM I I 

Db 1 MEVLWMLLVLLVLHLSSLAMSLSTCKAVDMEEVRKRRIEAIRGQILSKLKLDKTPDV-DS 59 

Qy 69 PPGPLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDK 123 

M : Mill : : - I : Ml MM I II - I 

Db 60 EKMTVPSEAIFLYNSTLEVIREKATREEEHVGHDQNIQDYYAKQVYR FESITELED- 115 

Qy 124 FKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKL--KVEQHVELYQKYSNDSW-- 179 

I II I Ml I M lllh : : :M Mh II : 

Db 116 HEFKFKFNASHVRENVGMNSLLHHAELRMYKKQTDKNMDQRMELFWKYQENGTTH 170 

Qy 180 -RYLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGFRLSAH---SSSDSKDNTLHVE 235 

III : I IIMMII I Ml I I I M : Ml 
Db 171 SRYLESKYITPVTDDEWMSFDVTKTVNEWLKRAEENEQFGLQPACKCPTPQAKD ID 226 

Qy 236 INGFNSGRRGDLATIHGM--NRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYC 293 

III Mill:: M:|:: : I M : Ml =1 : M 

Db 227 lEGF-PALRGDLASLSSKENTKPYLMITSMPAERIDTVTSSRKKR GVGQEYC 277 

Qy 294 FSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQ 353 

I : lllh MhlMMMIMMMM IhIM M II II M 1 1 1 1 II 1 : 1 1 II 

Db 278 FGNNGPNCCVKPLYINFRKDLGWKWIHEPKGYEANYCLGNCPYIWSMDTQYSKVLSLYNQ 337 

Qy 354 HNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

:|MM MUM MlllhlllM IMIIIIhllll M 

Db 338 NNPGASISPCCVPDVLEPLPIIYYVGRTAKVEQLSNMWRSCNCS 382 



RESULT 13 

TGF3_CHICK 

ID TGF3_CHICK STANDARD; PRT; 412 AA. 

AC P16047; 

DT Ol-APR-1990 (Rel . 14, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) , 

GN TGFB3 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Gall if ormes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=89096966; PubMed=3211158 ; 

RA Jakowlew S.B., Dillard P.J., Kondaiah P., Sporn M.B., Roberts A.B,; 

RT "Complementary deoxyribonucleic acid cloning of a novel transforming 

RT growth factor-beta messenger ribonucleic acid from chick embryo 

RT chondrocytes . " ; 

RL Mol. Endocrinol. 2:747-755(1988). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=White leghorn; 

RX MEDLINE=95169270; PubMed=78 65 12 9 ; 

RA Burt D.W., Dey B.R., Paton I.R., Morrice D.R., Law A.S./ 

RT "The chicken transforming growth factor-beta 3 gene: genomic 

RT structure, transcriptional analysis, and chromosomal location,"; 

RL DNA Cell Biol. 14:111-123(1995). 

RN [3] 

RP SEQUENCE OF 1-117 FROM N.A. 

RC STRAIN=White leghorn; TISSUE=Blood; 

RX MEDLINE=92134496; PubMed=1840616 ; 

RA Burt D.W., Dey B.R., Paton I.R.; 

RT "Comparative analysis of human and chicken transforming growth 

RT factor-beta 2 and -beta 3 promoters . " ; 

RL J. Mol. Endocrinol. 7:175-183(1991). 

RN [4] 

RP SEQUENCE OF 1-117 FROM N.A. 

RX MEDLINE=93024487/ PubMed=14 06706 ; 

RA Jakowlew S.B., Lechleider R. , Geiser A.G., Kim S.J., 

RA Santa -Coloma T.A. , Cubert J., Sporn M.B., Roberts A.B. ; 

RT "Identification and characterization of the chicken transforming 

RT growth factor-beta 3 promoter."; 

RL Mol. Endocrinol. 6:1285-1298(1992). 

CC FUNCTION: INVOLVED IN EMBRYOGENESI S AND CELL DIFFERENTIATION. 

CC SUBUNIT: Homodimer; disulf ide-linked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



or send an email to license@isb-sib.ch) 



EMBL; M31154; AAA49089.1/ -. 

EMBL; X58127; CAA41128.2; 

EMBL; X60055; CAA41128.2; JOINED. 

EMBL; X60091; CAA41128.2; JOINED. 

EMBL; X60090; CAA41128.2; JOINED. 

EMBL; S46000; AAB23575.1; -. 

PIR; A34939; A34939. 

HSSP; P10600; ITGJ . 

InterPro; IPR002400; GF_cysknot . 

InterPro; IPR003911; TGF_TGFb. 

InterPro; IPR001839; TGFb. 

InterPro; IPROOllll; TGFb_N. 

Pfam; PF00019; TGF-beta; 1. 

Pfam; PF00688; TGFb_propept ide ; 1. 

PRINTS; PRO 043 8; GFCYSKNOT. 

PRINTS; PRO 1423; TGFBETA. 

ProDom; PD000357; TGFb; 1. 

SMART; SM00204; TGFB; 1. 

PROSITE; PS00250; TGF_BETA_1 ; 1. 

Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPER 


24 


300 




FT 


CHAIN 


301 


412 


TRANSFORMING GROWTH FACTOR BETA 3. 


FT 


DISULFID 


307 


316 


BY SIMILARITY. 


FT 


DISULFID 


315 


378 


BY SIMILARITY. 


FT 


DISULFID 


344 


409 


BY SIMILARITY. 


FT 


DISULFID 


348 


411 


BY SIMILARITY. 


FT 


DISULFID 


377 


377 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


74 


74 


N-LINKED {GLCNAC- - .) (POTENTIAL) 


FT 


CARBOHYD 


135 


135 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


142 


142 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


261 


263 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


CONFLICT 


323 


326 


ELPT -> DFRQ (IN REF . 1). 


SQ 


SEQUENCE 


412 AA; 


47077 


MW; 1CAB883170069D55 CRC64 ; 



Query Match 40.5%; Score 857; DB 1; Length 412; 

Best Local Similarity 46.4%; Pred. No. 1.9e-60; 

Matches 195; Conservative 56; Mismatches 117; Indels 52; Gaps 14; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I :h : Ihl hhl :hM:|||IIIIIIIIM III II \ 

Db 9 LVLLSLLSFATVSLALSSCTTLDLEHIKKKRVEAIRGQI LSKLRLTSPPE --SVGPAHVP 66 

Qy 75 EAVLALYNSTRDRVAGESVEPEPE PEADYYAKEVTRVLMVE- - -SGNQI YDKF 124 

Ullllllh : 1:111 h:||lll: : I- I- 

Db 67 YQILALYNSTRELL--EEMEEEKEESCSQENTESEYYAKEIHKFDMIQGLPEHNELGICP 124 

Qy 125 KGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND 177 

II II I I I III hlh : II Hhl I 

Db 125 KGVTSNVFR-FNVS SAEKNSTNLFRAEFRVLRVPNPSSKRSEQRIELFQILRPDEHI 180 

Qy 178 -SWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKD 22 9 

Mil I : IIIMIIIM Ihll Ih I :|| : I : 

Db 181 AKQRYLSGRNVQTRGSPEWLSFDVTDTVREWLLHRESNLGLEISIHCPCHTFQPNGDILE 24 0 



Qy 230 N TLHVEINGFNSG---RRGDLATI---HGMNRPFLLLMATPLERAQH--LHSSRHRR 278 

I I - I :| MM ^ I MM \ \ I 1 M 

Db 241 NLHEVLEIKFKGIDSEDDYGRGDLGRLKKQKDLHNPHLILMMLPPHRLESPTLGGQRKKR 300 

Qy 279 DYKDDDDKALDTNYCFSSTEPCNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 338 

MMMM ' MMMI MM MMMMMMI MM MMM 

Db 301 ALDTNYCFRNLEENCCVRPLYIELPTDLGWKWVHEPKGYFANFCSGPCPYLR 352 

Qy 339 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

I M M M Ml M MMMMM MM MMMI IIIIIMIMMMIII 

Db 353 SADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 

RESULT 14 

TGF3_PIG 

ID TGF3_PIG STANDARD; PRT; 4 09 AA . 

AC P15203; 

DT Ol-APR-1990 (Rel . 14, Created) 

DT Ol-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Suina; Suidae; Sus. 

OX NCBI_TaxID=9823 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX MEDLINE=89091120; PubMed=32 08746 ; 

RA Derynck R., Lindquist B., Lee A., Wen D. , Tamm J., Graycar J.L., 

RA Rhee L. , Mason A.J., Miller D.A., Coffey R.J., Moses H,L., 

RA Chen E . Y . ; 

RT "A new type of transforming growth factor-beta, TGF-beta 3."; 

RL EMBO J. 7:3737-3743(1988). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESIS AND CELL DIFFERENTIATION. 

CC SUBUNIT: Homodimer; disul fide-linked . 

CC SUBCELLULAR LOCATION: Secreted. 

CC SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; X14150; CAA32363.1; 

DR PIR; S01825; S01825. 

DR HSSP; P10600; ITGJ. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb__N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 



DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 18 POTENTIAL. 

FT PROPEP 19 297 

FT CHAIN 298 409 TRANSFORMING GROWTH FACTOR BETA 3. 

FT DISULFID 304 313 BY SIMILARITY. 

FT DISULFID 312 375 BY SIMILARITY. 

FT DISULFID 341 406 BY SIMILARITY. 

FT DISULFID 345 408 BY SIMILARITY. 

FT DISULFID 374 374 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 72 72 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 133 133 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 140 140 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 259 261 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 409 AA; 46814 MW; B4900235B5CC955E CRC64 ; 

Query Match 39.7%; Score 838.5; DB 1; Length 409; 

Best Local Similarity 45.6%; Pred. No. 5.4e-59; 

Matches 190; Conservative 58; Mismatches 122; Indels 47; Gaps 14; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h Ml : hi: HIMHIIIIIIIIIIII III : H 

Db 7 LVVLALLNFATVSLSMSTCTTLDFDHIKRKRVEAIRGQILSKLRLTSPPDPSML--ANIP 64 

Qy 75 EAVLALYNSTR DRVAGESVE- -PEPEPEADYYAKEVTRVLMV ESGNQIYDKFKG 126 

II mill : I II : : hHMIh : |: I I : || 
Db 65 TQVLDLYNSTRELLEEVHGERGDDCTQENTESEYYAKEIYKFDMIQGLEEHNDLAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

-III:: I Ml hlh : II :|hl I 

Db 125 ITSKIFR-FNVSSVEK NETNLFRAEFRVLRMPNPSSKRSEQRI ELFQILQPDEHIAK 180 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 230 

l|: : I : IIIIIIII Ihll llh I :| 1 : I :| 

Db 181 QRYIDGKNLPTRGAAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGDILENI 240 

Qy 231 --TLHVEINGFNS->--GRRGDLATIHGM--NRPFLLLMATPLERAQH--LHSSRHRRDYK 281 

: :: I :| MM : : IMMI I :| : I : I :| 

Db 241 QEVMEIKFKGVDSEDDPGRGDLGRLKKKKEHSPHLILMMIPPDRLDNPGLGAQRKKR--- 297 

Qy 282 DDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLD 341 

IIIMIII : hlllll IIMIhlllllMIIMIhllll Mill: I I 
Db 298 ALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSAD 352 

Qy 342 TQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

I :| II III II IIMIIIIII MM MIMII IIIMIM:|:|MII 
Db 353 TTHSSVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTAKVEQLSNMWKSCKCS 409 



RESULT 15 

TGF3_M0USE 

ID TGF3_M0USE STANDARD; PRT; 410 AA. 

AC P17125; 



DT Ol-AUG-1990 (Rel . 15, Created) 

DT Ol-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata ; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID-10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=9 019 0650; PubMed=2 62873 0 ; 

RA Miller D.A. , Lee A., Matsui Y., Chen E.Y., Moses H.L. , Derynck R. ; 

RT "Complementary DNA cloning of the murine transforming growth 

RT factor-beta 3 (TGF beta 3) precursor and the comparative expression 

RT of TGF beta 3 and TGF beta 1 messenger RNA in murine embryos and 

RT adult tissues."; 

RL Mol. Endocrinol. 3:1926-1934(1989). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91000714; PubMed=22 06556 ; 

RA Denhez F., Lafyatis R., Kondaiah P., Roberts A.B., Sporn M.B.; 

RT "Cloning by polymerase chain reaction of a new mouse TGF-beta, 

RT mTGF-beta 3. "; 

RL Growth Factors 3:139-146(1990). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESIS AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; M32745; AAA40422.1; -. 

DR PIR; A41397; A41397. 

DR HSSP; P10600; ITGJ. 

DR MGD; MGI: 98727; Tgf b3 . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PRO 14 23; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 298 POTENTIAL. 

FT CHAIN 299 410 TRANSFORMING GROWTH FACTOR BETA 3. 
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o I b i M i LAK ill. 


FT 


DISULFID 


313 


376 


BY SIMILARITY. 


FT 


DISULFID 


342 


407 


BY SIMILARITY. 


FT 


DISULFID 


346 


4 n Q 

*± vj :? 




FT 


DISULFID 


375 


375 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


72 


72 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


133 


133 


N-LINKED (GLCNAC, - -) (POTENTIAL) 


FT 


CARBOHYD 


140 


140 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


259 


261 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


410 AA; 


46884 MW; 


250F7048CA432BD6 CRC64 ; 


Query Match 




39.5%; 


Score 835; DB 1; Length 410; 



Best Local Similarity 45.2%; Pred. No. le-58; 

Matches 188; Conservative 58; Mismatches 126; Indels 44; Gaps 13; 
Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I :| : MM hi MMhMMMMIMM Ml : M 

Db 7 LVVLALLNLATISLSLSTCTTLDFGHIKKKRVEAIRGQILSKLRLTSPPEPSVMT--HVP 64 

Qy 75 EAVLALYNSTR---DRVAGESVE--PEPEPEADYYAKEVTRVLMVE---SGNQIYDKFKG 126 

lllllllll : : II I hMlllh : h: h: II 

Db 65 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFDMIQGLAEHNELAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

III : : I Ml hlh : II Mhl I 

Db 125 ITSKVFR-FNVSSVEK---NGTNLFRAEFRVLRVPNPSSKRTEQRIELFQILRPDEHIAK 180 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 230 

Ih :| : IIIMIII Ihll III: I Ml : I M 

Db 181 QRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGDILENV 24 0 

Qy 231 --TLHVEINGFNS---GRRGDLATIHGM---NRPFLLLMATPLERAQHLHSSRHRRDYKD 282 

I - MM : : I MM I I |: 
Db 241 HEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNPHLILMMIPPHRLDSPGQGSQRK 296 

Qy 283 DDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 342 

MIIIIMI : hIMII lllllhllMlhlMIMMIM IIMh I M 

Db 2 97 --KRALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSADT 354 

Qy 343 QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

M II III II Mhllllll MM hillll IIIMIIIhhIIIII 

Db 355 THSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 410 



Search completed: October 28, 2003, 09:08:38 
Job time : 9.97725 sees 

GenCore version 5.1,6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: October 28, 2003, 07:50:55 ; Search time 31.909 Seconds 

(without alignments) 
3218.683 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-017-372E-9 
2114 

1 MAPSGLRLLPLLLPLLWLLV 



GRKPKVEQLSNMIVRSCKCS 398 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 830525 seqs, 258052604 residues 

Total number of hits satisfying chosen parameters: 830525 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SPTREMBL_23 : * 

1: sp_archea:* 

2: sp_bacteria : * 

3 : sp_f ungi : * 

4 : sp_human : * 

5 : sp__ invertebrate : * 

6: sp_mammal:* 

7 : sp_mhc : * 

8: sp_organelle: * 

9 : sp_phage : * 
10: sp_plant:* 
1 1 : sp_rodent : * 
12: sp_virus:* 
13 : sp_vertebrate : * 
14 : sp_unclassif ied: * 
15: sp_rvirus:* 
16: sp_bacteriap : * 

17 : sp_archeap : * 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
Q9TUM8 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 

oc 
oc 
ox 

RN 
RP 
RA 
RT 
RT 



Q9TUM8 
Q9TUM8 ; 
Ol-MAY-2000 
Ol-MAY-2000 
Ol-MAR-2003 



PRELIMINARY; 



PRT; 



390 AA. 



Created) 

Last sequence update) 
Last annotation update) 



(TrEMBLrel. 13 
(TrEMBLrel. 13 
(TrEMBLrel. 23 
Transforming growth factor beta 1. 
TGFBl . 

Equus cabaiius (Horse) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Perissodactyla ; Equidae; Equus . 
NCBI_TaxID=9796; 
[1] 

SEQUENCE FROM N.A, 

Nixon A.J., Brower-Toland B.T., Sandell L.J.; 

"Molecular cloning of equine transforming growth factor beta 1 reveals 
a divergent nucleotide structure that encodes a novel bioactive 



RT peptide among mammalian species,"; 

RL Submitted (AUG-1999) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF175709; AAD49431.1; -. 

DR HSSP; P01137; IKLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF0068 8; TGFb_propept ide ; 1. 

DR PRINTS; PR004 38; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 390 AA; 43860 MW; 22 0FE4 0DFCCA6016 CRC64; 

Query Match 86.1%; Score 1820; DB 6; Length 390; 

Best Local Similarity 87.4%; Pred. No. 6.5e-154; 

Matches 34 8; Conservative 12; Mismatches 30; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIM 
Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

MlllhlllllllllllllMIII :|llll I IIIIIIIIIIMMIIIIII hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRAQVAGESAETEPEPEADYYAKEVTRVLMVEKENEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

I : Ihll II Mil llhhilllllllllllll llllllllllllhlll 
Db 121 YKTVETGSHSIYMFFNASELRAAVPDPMLLSRAELRLLRLKLSVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

lllllllllllllllllllllllllll:: hi Ml III I II I II I I III h 
Db 181 YLSNRLLTPSDSPEWLSFDVTGWRQWLSQGGAMEGLRLSAHCPCDSKDNTLRVGINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKN 300 

I IMIMII IIIIIIIIIIMIIIIII llllllll lllllll llllll 

Db 241 SSRRGDLATIDGMNRPFLLLMATPLERAQQLHSSRHRR ALDTNYCSSSTEKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

MM MM II lllllll III! II III III II II II I III III I II III I III I II II III 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

lllllll MIIIIIIIIIIIIIIIIMIMIMIIII 

Ub 3 53 APCCVPQVLEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 2 
Q8R4D9 

ID Q8R4D9 PRELIMINARY; PRT; 368 AA. 

AC Q8R4D9; 

DT Ol-JUN-2002 (TrEMBLrel . 21, Created) 

DT Ol-JUN-2002 (TrEMBLrel. 21, Last sequence update) 



DT Ol-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Transforming growth factor beta-1 protein (Fragment) . 

GN TGFBl . 

OS Sigmodon hispidus (Hispid cotton rat) . 

OC Eu)caryota; Metazoa; Chordata; Craniata; Vertebra ta; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Sigmodontinae; 

OC Sigmodon. 

OX NCBI_TaxID=42415; 

RN [1] 

RP SEQUENCE FROM N,A. 

RA Blanco J.C., Pletneva L.M. , Prince G.A. ; 

RT "Cotton rat cytokines, chemokines, and interferons."; 

RL Submitted (FEB-2002) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF480858; AAL87199.1; 

DR InterPro; IPR002400; GF_cysknot , 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 368 AA; 41905 MW; A5C91207B0468B4A CRC64; 

Query Match 82.4%; Score 1741; DB 11; Length 368; 

Best Local Similarity 86.4%; Pred. No. 6.7e-147; 

Matches 325; Conservative 16; Mismatches 27; Indels 8; Gaps 1; 
Qy 23 PGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYN 82 

IIIIIIIIIIMIIIMIIIIIMIMIIIIIIIIIIIIIIIIhlllllllllllllll 

Db 1 PGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYN 60 

Qy 83 STRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELRE 142 

Illlilllll :||lllllllllllllllllh I MM I llhll lllhMI 
Db 61 STRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI YDKTKDI PHSVYMFFNTSDI RE 120 

Qy 143 AVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTG 2 02 

MM! lllllllll I I lllllllhlllhlllll lllhhIIIIIIIIIM 

Db 121 AVPEPPLLSRAELRLQRFKSNVEQHVELYEKYSNNSWRYLGNRLLSPTDSPEWLSFDVTS 180 

Qy 2 03 WRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMA 262 

llhll : : hill III I Mill IIIMM : lllll Ml llllllllll 
Db 181 WRKWLNQGDGIQGFRFSAHCSCDSKDNILHVEINGISPKRRGDLGTIHDMNRPFLLLMA 240 

Qy 263 TPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEP 322 

II I M M II I II II II 1 1 II II I II 1 1 II II II M II I II M II M M II 1 1 

Db 241 TPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEP 292 

Qy 323 KGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKP 382 

IIIMIMMIIMIMIMMIIIIIMIIIIIIMhMIIMIMIIMIMillll 

Db 2 93 KGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASASPCCVPQALEPLPIVYYVGRKP 3 52 

Qy 383 KVEQLSNMIVRSCKCS 398 

Illlllllllllllll 



Db 353 KVEQLSNMIVRSCKCS 368 



RESULT 3 
Q8JHF5 



ID Q8JHF5 PRELIMINARY; PRT; 379 AA. 

AC Q8JHF5; 

DT Ol-OCT-2002 (TrEMBLrel . 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

OS Sparus aurata (Gilthead sea bream) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha ; Perciformes; Percoidei; 

OC Sparidae; Sparus. 

OX NCBI_TaxID=8175; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Tafalla C, Aranguren R., Secombes C.J., Castrillo J.L., Novoa B., 

RA Figueras A. ; 

RT "Molecular characterization of sea bream (Sparus aurata) transforming 

RT growth factor betal."; 

RL Fish and Shellfish Immunol. 0:0-0(2002). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF510084; AAN03842.1; 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPROOllll; TGFb_N. 

DR InterPro; IPR003911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_prQpeptide; 2. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD0 00357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1, 

DR PROSITE; PS00250; TGF__BETA_1 ; 1. 

SQ SEQUENCE 379 AA; 43506 MW; C0C9D3D2FCA2 9C0E CRC64; 

Query Match 4 0.1%; Score 84 8; DB 13; Length 379; 

Best Local Similarity 46.5%; Pred. No. 3.5e-67; 

Matches 186; Conservative 68; Mismatches 110; Indels 36; Gaps 15; 

Qy 12 LLPLLVJLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRL-ASPPSQGDVPP 70 

h h::: I h : I : I I I I I I : h I h I I I I I I I I I I I I I I I I II 

Db 3 LVFLMFMVAYTVGK-VSGMSTCKTLDLEMVKKKRIEAIRSQILSKLRLPTESPQAGD--E 59 

Qy 71 GPLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDKFK 125 



. I . . I . I I I I I . . . . I . . I . I . I III • I • 
Db 60 EEIPSSLLSLYNSTKEMLKEQQTEVQTDIFTEQEEEEYFA TRVHKFNTTNPV 111 

Qy 126 GTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV EQHVELYQKYSNDSWRYL 182 

II - I M Ihl : Ih IMhl U II lllll MM 

Db 112 RTPQNMSMSFNISEIRRSIGDYRLLTTAELRML-IKAPTILDEQRVELYQGLGT-SPRYL 169 

Qy 183 SNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGF--RLSAHSSSDSKDNTLHVEINGFN 240 

: = l = HIIIIII :: || = - I II I =1 hi 



Db 



170 ASRFITNELRDKWLSFDVTETLQNWLKGNDDVQVFQLRLYCDCGRSSDVSTFSFGISGMT 229 



Qy 241 SGRRGDLATIHGMNR--PFLLIiMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTE 298 

:MIM : \ 1-1 hi : II :M :| : I I : li 

Db 23 0 AG-RGDKAVLDDMTKQPPYILTMSIPKNVSSHL-TSRKKRSTETKDT CTAQTE 28 0 

Qy 2 99 KNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGA 358 

III! Illlllllllllllhl lllhhl I III: : :||::|||| Mill 
Db 281 -TCCVRSLYIDFRKDLGWKWIHKPTRYHANYCMGSCTYIWNAENKYSQILALYKHHNPGA 339 

Qy 359 SAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

II IIIMMIIIIIhllllh IIIIIMIIhlllll 

Db 340 SAQPCCVPQALEPLPILYYVGRQHKVEQLSNMIVKSCKCS 379 



RESULT 4 
Q8AXK8 



ID Q8AXK8 PRELIMINARY; PRT; 379 AA. 

AC Q8AXK8 ; 

DT Ol-MAR-2003 (TrEMBLrel . 23, Created) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

OS Sparus aurata (Gilthead sea bream) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha; Perci formes; Percoidei; 

OC Sparidae; Sparus. 

OX NCBI_TaxID=8175; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Tafalla C, Novoa B., Aranguren R., Figueras A.; 

RT "Molecular cloning and characterization of sea bream (Sparus aurata) 

RT TGF beta 1 . " ; 

RL Submitted (SEP-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF424703; AAN76665.1; 

SQ SEQUENCE 379 AA; 43485 MW; A306EC387F6DBA7C CRC64 ; 



Query Match 39.9%; Score 844; DB 13; Length 379; 

Best Local Similarity 46.5%; Pred, No. 8.1e-67; 

Matches 186; Conservative 67; Mismatches 111; Indels 36; Gaps 15; 
Qy 12 LLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRL-ASPPSQGDVPP 70 

h I-- I |: nnillhhhIhIIIMII llllllll I II 

Db 3 LVFLMFMVAYTVGK-VSGMSTCKTLDLEMVKKKRIEAIRSQILSKLRLPTESPQAGD--E 59 

Qy 71 GPLPEAVLALYNSTRDRVAGESVEPEPE PEADYYAKEVTRVLMVESGNQI YDKFK 125 

:| -hlllll- : : I : : | :|:| ||| : | : 
Ob 60 EEIPSSLLSLYNSTKEMLKEQQTEVQTDIFTEXEEEEYFA TRVHKFNTTNPV 111 

Qy 126 GTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV EQHVELYQKYSNDSWRYL 182 

I II Ihl : Ih lllhl H II Mill I III 

Db 112 RTPQNMSMSFNISEIRRSIGDYRLLTTAELRML-IKAPTILDEQRVELYQGLGT-SPRYL 169 

Qy 183 SNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGF--RLSAHSSSDSKDNTLHVEINGFN 240 

-I : :|llllll II : - I II I :| hi 

Db 170 ASRFITNELRDKWLSFDVTETLQNWLKGNDDVQVFQLRLYCDCGRSSDVSTFSFGISGMT 229 



Qy 241 SGRRGDLATIHGMNR--PFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNyCFSSTE 298 

:MII I : I : l-l |: I : M :| : I I : I I 

Db 230 AG-RGDKAVLDDMTKQPPYILTMSIPKNVSSHL-TSRKKRSTETKDT CTAQTE 280 

Qy 2 99 KNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGA 358 

MM MMMMMMMhl MMMM I Mh ^ Mh MM MMI 

Db 281 -TCCVRSLYIDFRKDLGWKWIHKPTRYHANYCMGSCTYIWNAENKYSQXLALYKHHNPGA 33 9 

Qy 3 59 SAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

M MMMIMMMMMM: M I M M M MM M M 

Db 340 SAQPCCVPQALEPLPI LYYVGRQHKVEQLSNMI VKSCKCS 379 



RESULT 5 
Q91YU7 

ID Q91YU7 PRELIMINARY; PRT; 412 AA. 

AC Q91YU7; 

DT Ol-DEC-2001 (TrEMBLrel . 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor, beta 3. 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R.; 

RL Submitted (OCT-2001) to the EMBL/GenBank/DDBJ databases, 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC014690; AAH14690.1; -. 

DR MGD; MGI: 98727; Tgfb3 . 

DR InterPro; IPR002400; GF_cysknQt . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; I PRO Oil 11; TGFb_N. 

DR InterPro; IPR003911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD00 03 57; TGFb; 1. 

DR SMART; SM0 02 04; TGFB; 1, 

DR PROSITE; PS 00250; TGF_BETA__1 ; 1. 

SQ SEQUENCE 412 AA; 47144 MW; F3EB65D046DF32AD CRC64 ; 

Query Match 39.5%; Score 835; DB 11; Length 412; 

Best Local Similarity 45.2%; Pred. No. 5.8e-66; 

Matches 188; Conservative 58; Mismatches 126; Indels 44; Gaps 13; 
Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

M I M : MM hi MMhMMMMMMI Ml : M 

Db 9 LVVLALLNLATISLSLSTCTTLDFGHIKKKRVEAIRGQILSKLRLTSPPEPSVMT--HVP 66 

Qy 75 EAVLALYNSTR---DRVAGESVE--PEPEPEADYYAKEVTRVLMVE---SGNQIYDKFKG 126 

lllllllll : : II I : hMlllh : j:: j:: || 



Db 



67 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFDMIQGLAEHNELAVCPKG 126 



Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

-III:: I III hlh : M :||:| I 

Db 127 ITSKVFR-FNVSSVEK---NGTNLFRAEFRVLRVPNPSSKRTEQRIELFQILRPDEHIAK 182 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 230 

l|: : I : IIMIMI Ihll llh I :|l : I :| 

Db 183 QRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGDILENV 242 

Qy 231 "-TLHVEINGFNS---GRRGDLATIHGM NRPFLLLMATPLERAQHLHSSRHRRDYKD 282 

: :: I :: ill! : : I hll I I h 
Db 243 HEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNPHLILMMIPPHRLDSPGQGSQRK 2 98 

Qy 283 DDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 342 

:||||MM : hlllll lllllhlllllhlllllhllll llllh I II 
Db 299 '-KRALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSADT 356 

Qy 343 QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

:| II III II llhllllll II III: llllllll III II hh II III 
Db 357 THSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 



RESULT 6 
Q91VP5 

ID Q91VP5 PRELIMINARY; PRT; 414 AA. 

AC Q91VP5; 

DT Ol-DEC-2001 (TrEMBLrel . 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel- 23, Last annotation update) 

DE Similar to transforming growth factor, beta 2, 

GN TGFB2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Breast tumor; 

RA Strausberg R.; 

RL Submitted (JUL-2001) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC011170; AAH11170.1; 

DR MGD; MGI: 98726; Tgf b2 . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb__N . 

DR InterPro; I PRO 03 911; TGF^TGFb. 

DR Ptam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

SQ SEQUENCE 414 AA; 47588 MW; DB37A7C3888 1F286 CRC64; 



Query Match 39.2%; Score 828.5; DB 11; Length 414; 

Best Local Similarity 44.2%; Pred. No. 2.2e-65; 

Matches 188; Conservative 63; Mismatches 121; Indels 53; Gaps 15; 



Qy 12 LLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVP-P 70 

H Mill MM hlh IIIIMMIIMIIIM Ml III 

Db 5 VLSTFLLLHLVP--VALSLSTCSTLDMDQFMRKRIEAIRGQILSKLKLTSPPE--DYPEP 60 

Qy 71 GPLPEAVLALYNSTRD RVAGESVEPEPE- PEADYYAKEVTRVLM- - -VESGNQI YD 122 

M h-MMM : : : II : MM Ml - I \ \ \ 

Db 61 DEVPPEVISIYNSTRDLLQEKASRRAAACERERSDEEYYAKEVYKIDMPSHLPSENAIPP 120 

Qy 123 KFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKV-EQHVELYQ K 173 

I h' : h I : : I Ml h II I M M Mill 

Db 121 TFY-RPYFRIVRFDVSTMEKNASN---LVKAEFRVFRLQNPKARVAEQRIELYQILKSKD 176 

Qy 174 YSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSS 225 

:: : ||: :::: MIMMI hMI - IhM I 

Db 177 LTSPTQRYIDSKWKTRAEGEWLSFDVTDAVQEWLHHKDRNLGFKISLHCPCCTFVPSNN 236 

Qy 226 DSKDNTLHVE INGFNSGRRGDLATIHGMNR PFLLLMATPLERAQHLHS 273 

M I hi Mil: I MM I I : I 

Db 237 YIIPNKSEELEARFAGIDGTSTYASGDQKTIKSTRKKTSGKTPHLLLMLLPSYRLESQQS 296 

Qy 274 SRHRRDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGP 333 

II MM III : : Mhl I I II h M II II II M II II M II I I 

Db 297 SRRKK RALDAAYCFRNVQDNCCLRPLYIDFKRDLGWKWIHEPKGYNANFCAGA 34 9 

Qy 334 CPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVR 393 

Ilhll llhMMMM M MIMMI I MM IMIM IIMIIIMIh 

Db 350 CPYLWSSDTQHTKVLSLYNTINPEASASPCCVSQDLEPLTILYYIGNTPKIEQLSNMIVK 409 

Qy 394 SCKCS 3 98 

Mill 

Db 410 SCKCS 414 



RESULT 7 

Q9PWA9 

ID Q9PWA9 PRELIMINARY; PRT; 382 AA. 

AC Q9PWA9; 

DT Ol-MAY-2000 (TrEMBLrel . 13, Created) 

DT Ol-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

GN TGF-BETA. 

OS Morone chrysops x Morone saxatilis (white bass x striped bass) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Perci formes; Percoidei; 

OC Moronidae; Morone. 

OX NCBI__TaxID=45352 ; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=KIDNEY; 

RX MEDLINE=2 0394636; PubMed=l 0938723 ; 

RA Harms C.A., Kennedy- St oskopf S., Horne W.A., Fuller F.J., 



RA Tompkins W.A.F.; 

RT "Cloning and sequencing hybrid striped bass (Morone saxatilis x M. 

RT chrysops) transforming growth factor-beta (TGF-beta) , and development 

RT of a reverse transcription quantitative competitive polymerase chain 

RT reaction (RT-qcPCR) assay to measure TGF-beta mRNA of teleost fish."; 

RL Fish Shellfish Immunol. 10:61-85(2000). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE. MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS. 

CC -!- SUBUNIT: HOMODIMER, DISULFIDE-LINKED (BY SIMILARITY). 

CC -!- TISSUE SPECIFICITY: HIGHER LEVELS FOUND IN MONONUCLEAR CELLS FROM 

CC PERIPHERAL BLOOD THAN IN SPLEEN OR ANTERIOR KIDNEY. 

CC SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF140363; AAD46997.1; 

DR HSSP; P01137; IKLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N. 

DR InterPro; I PRO 03 911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT . 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


? 


POTENTIAL. 


FT 


PROPEP 


9 


270 




FT 


CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA. 


FT 


DISULFID 


278 


286 


BY SIMILARITY. 


FT 


DISULFID 


285 


348 


BY SIMILARITY. 


FT 


DISULFID 


314 


379 


BY SIMILARITY. 


FT 


DISULFID 


318 


381 


BY SIMILARITY. 


FT 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


73 


73 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


108 


108 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


113 


113 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


124 


124 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


259 


259 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


SITE 


235 


237 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


382 AA; 


43846 


MW; 3124D8C34EA74D72 CRC64; 


Query Match 




39.0% 


; Score 824.5; DB 13; Length 382; 


Best Local Similarity 


46.1% 


; Pred. No. 4.5e-65; 


Matches 184; 


Conservative 


64; Mismatches 114; Indels 37; Gaps 



14; 



Qy 

Db 

Qy 

Db 

Qy 
Db 



15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRL ASPPSQGDVPPG 71 

1^ :H I I IIIIMII | || 

6 LMLVVVYTVGN-VSGMSTCKTLDLEMVKKKRIEAIRSQILSKLRLPKEPEPDQAGDEEEI 64 



72 PLPEAVLALYNSTRDRVAGESVEPE-- 

I I :hllllh: : : I : 

65 PTP - - LLSLYNSTKEMLKEQQTEVQTDI STEQEEEEYFAKVLHKFNMTRKNN 



PEPEADYYAKEVTRVLMVESGNQIYDKFKG 12 6 

114 



127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV- -EQHVELYQKYSNDSWRYLSN 184 

I ^ I II Ihlhl : Ih lllhl I : II INI II Mh: 

115 TDTTKKMFFNISEIRESVGDYRLLTSAELRMLIKKTTIYDEQRVELYSGL-GDSPRYLAS 173 



Qy 18 5 RLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGF--RLSAHSSSDSKDNTL-HVEINGFNS 241 

I : II : Ul II : I h hi : 

Db 174 RFITNKWKDKWLSFDVTKTLQDWLKGTDDEQGFQLRLFCECNKVSAGETIFKFGISGIDP 233 

Qy 242 GRRGDLATIHGMNR--PFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEK 299 

I III • • h:| hi I- I I : : I : I : II 

Db 234 G-RGDTGPMQLLTQQPPYILTMSIP QNISSPSTSRKKRSTETK DVCTAQTE- 283 

Qy 300 NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 359 

MM IIIIIIIMIIMIhl Ihlh-hll llh : :|h:|lll I Mill 

Db 284 TCCVRSLYIDFRKDLGWKWIHKPTGYNANYCMGSCTYIWNAENKYSQILALYKHHNPGAS 343 

Qy 360 AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

I llllllllllllhllllh IMIIIIIIhlllll 

Db 344 AQPCCVPQALEPLPILYYVGRQHKVEQLSNMIVKSCKCS 382 

RESULT 8 

093449 

ID 093449 PRELIMINARY; PRT; 382 AA. 

AC 093449/ Q91217; 

DT Ol-NOV-1998 (TrEMBLrel . 08, Created) 

DT Ol-NOV-1998 (TrEMBLrel . 08, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

GN TGF-BETA OR TGF . 

OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi / 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii ; Salmonif ormes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8 022; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TI SSUE=:LEUKOCYTE ; 

RX MEDLINE=99242020; PubMed=10227481 ; 

RA Daniels G.D., Secombes C. J. ; 

RT "Genomic organisation of rainbow trout, Oncorhynchus mykiss TGF- 

RT BETA."; 

RL Dev. Comp. Immunol. 23:139-147(1999). 

RN [2] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TI SSUE=LEUKOCYTE ; 

RX MEDLINE=98390168; PubMed=9722928 ; 

RA Hardie L.J., Laing K.J., Daniels G.D., Grabowski P.S., Cunningham C. , 

RA Secombes C.J.; 

RT "Isolation of the first piscine transforming growth factor beta gene: 

RT analysis reveals tissue specific expression and a potential regulatory 

RT sequence in rainbow trout (Oncorhynchus mykiss)."; 

RL Cytokine 10:555-563(1998). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE. MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS. 

CC -!- SUBUNIT: HOMODIMER, DISULFIDE -LINKED (BY SIMILARITY). 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN BLOOD LEUKOCYTES, KIDNEY 

CC MACROPHAGES, BRAIN, GILL AND SPLEEN BUT NOT IN LIVER. 

CC SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AJ007836; CAA07707.1; 



DR EMBL; X99303; CAA67685.1; -. 

DR HSSP; P01137; IKIA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N. 

DR InterPro; I PRO 03911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 



FT 
FT 
FT 
SQ 



SIGNAL 


1 


20 


POTENTIAL. 


PROPER 


21 


270 




CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA. 


DISULFID 


278 


286 


BY SIMILARITY. 


DISULFID 


285 


348 


BY SIMILARITY. 


DISULFID 


314 


379 


BY SIMILARITY. 


DISULFID 


318 


381 


BY SIMILARITY. 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


CARBOHYD 


76 


76 


N-LINKED (GLCNAC, . .) (POTENTIAL) 


CARBOHYD 


116 


116 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


125 


125 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


CONFLICT 


237 


237 


N -> D (IN REF. 2) . 


CONFLICT 


345 


345 


Q -> H (IN REF. 2) . 


CONFLICT 


371 


372 


LS -> VP (IN REF. 2) . 


CONFLICT 


377 


377 


K -> M (IN REF. 2) . 


) SEQUENCE 


382 AA; 


44136 


MW; 93BD4D3540084B92 CRC64 ; 


Query Match 




38. 2S 


k; Score 8 08.5; DB 13; Length 382; 



Best Local Similarity 47.1%; Pred. No. 1.2e-63; 

Matches 181; Conservative 57; Mismatches 107; Indels 39; Gaps 13; 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 



30 LSTCKTIDMELVKRKRIEAIRGQILSKLRLASPP SQGDVPPGPLPEAVLALYNSTRD 86 

:||||::hllllllllllllllllMI|| I HI :| :::::|||| : 

23 MSTCKSLDLELVKRKRIEAIRGQILSKLRLPKEPEIDQEGDTE--EVPASLMSIYNSTVE 80 

87 RVAGESVE PEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSEL 140 

-11 : I II hllll : t :1 I : I : III Ih 

81 -LSEEQVHTYIPSTQDAEEEA-YFAKEVHKFNMKQSENT SKHQI - -LFNMSEM 129 

141 REAVPEPVLLSRAELRLL RLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWL 196 

I : llhllllll I II :|lh I III : : h 

130 RSVLGTDRLLSQAELRLLIKNHGLLDDSEQRLELYRGV-GDKARYLKSHFVSKEWANRWV 188 

197 SFDVTGVVRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATI--HGMN 254 

Mill : :|l :|hl I Ih h 

189 SFDVTQTLNEWLQGAGEEQGFQLKLPCDCGKPMEEFRFKISGMNK-LRGNTETLAMKMPS 247 

255 RPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDL 314 

:| :|lh hll I III :| I I ::||lhlllllllll 

24 8 KPHILLMSLPVERHSQL-SSRKKRQ TTTEEICSDKSESCCVRKLYIDFRKDL 2 98 

315 GWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI 374 
MIIIIM II M:hlll llh : :|hlll|| IMIIII IIIMI llllll 



Db 299 GWKWIHEPTGYFANYCIGPCTYIWNTENKYSQVLALYKHHNPGASAQPCCVPQVLEPLPI 358 



Qy 3 75 VYYVGRKPKVEQLSNMIVRSCKCS 398 

:MMh IIIIIMIIhlhM 

Db 359 lYYVGRQHKVEQLSNMIVKSCRCS 382 

RESULT 9 
Q9PTQ2 

ID Q9PTQ2 PRELIMINARY; PRT; 376 AA. 

AC Q9PTQ2; 

DT Ol-MAY-2000 (TrEMBLrel . 13, Created) 

DT Ol-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

OS Cyprinus carpio (Common carp) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Ostariophysi ; Cypr ini formes; 

OC Cyprinidae; Cyprinus. 

OX NCBI_TaxID=7962; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Yin Z. , Kuang J.; 

RT "Molecular cloning of carp transforming growth factor beta 1."; 

RL Submitted (MAR-1999) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE. MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS. 

CC -!- SUBUNIT: HOMODIMER, DI SULFIDE-LINKED (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF136947; AAF22573.1; -. 

DR HSSP; P01137; IKLA. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N. 

DR InterPro; IPR003911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM0 02 04; TGFB; 1. 

DR PROSITE; PS 002 50; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 22 POTENTIAL. 

FT PROPEP 23 264 

FT CHAIN 265 376 TRANSFORMING GROWTH FACTOR BETA. 

FT DISULFID 272 280 BY SIMILARITY. 

FT DISULFID 308 373 BY SIMILARITY. 

FT DISULFID 312 375 BY SIMILARITY. 

FT DISULFID 341 341 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 76 76 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 125 125 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 167 167 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 230 232 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 376 AA; 43329 MW; 7F7FC4DA58B69681 CRC64 ; 

Query Match 37.0%; Score 782.5; DB 13; Length 376; 

Best Local Similarity 44.5%; Pred. No. 2.5e-61; 

Matches 179/ Conservative 67; Mismatches 119; Indels 37; Gaps 16; 



Qy 6 lrllplllpllwllvltpgrpaaglstcktidmelvkrp<:rieairgqilsklrlasppsq 65 

:h III I II : III! :hlhllMIIIIIIIIIII|lh I 

Db 1 MRVESLLIALQCLLGFV--HYSGALSTCSPLDLELIKRKRIEAIRGQILSKLRLSKEPEV 58 

Qy 66 GDVPPGP-LPEAVLALYNST RDRVAGESVEPEPEP-EADYYAKEVTR- -VLMVESGN 118 

:| :-:|IM : I U I UIIIM : : -I 

Db 59 DEEKESQNIPAELISVYNSTVELNEEQAAPPEQPKEDPVEEEYYAKEVHKFTIKLMEKKP 118 

Qy 119 QIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLK--VEQHVELYQKYSN 176 

III : II : -h Mill II :|||| | 

Db 119 DKF LWFNITDISQTLGLNRIISQVELRLLITTFPDGSEQRLELYQVIGN 167 

Qy 177 DSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEI 236 

I III :|: : :||IMM : : I I I I I : I I : I : I : | -I 

Db 168 KS-RYLESRFI--PNQRKWLSFDVTQTLKDWLQRSEAEQGFQLKiyiADNCDPQ-KTFQLKI 223 

Qy 237 NGFNSGRRGDLATIH-GMNRPFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFS 2 95 

I III h I II :hh Ih :||: II : : h I 
Db 224 PGLVL-VRGDTETLAVNMPRPHILVMSLPLDGN NSSKSRRKRQTETDQV CTD 274 

Qy 2 96 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 355 

WW lllllllllllllllll Ihlhl I I ::|: : Hhlllll II 
Db 275 KSD-GCCVRSLYIDFRKDLGWKWIHEPSGYYANYCTGSCSFVWTSENKYSQVLALYKHHN 333 

Qy 356 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKC 3 97 

mil II III I MM IMIh IMIMIIIhMII 

Db 334 PGASAQPCRVPQVLNPLPIFYYVGRQHKVEQLSNMIVKTCKC 375 



RESULT 10 
Q9ERB7 

ID Q9ERB7 PRELIMINARY; PRT; 399 AA, 

AC Q9ERB7; 

DT Ol-MAR-2001 (TrEMBLrel . 16, Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16 , Last sequence update) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Transforming growth factor-beta 2 (Fragment) . 

OS Mesocricetus auratus (Golden hamster) , 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi / 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Cricetinae; 

OC Mesocricetus . 

OX NCBI_TaxID=10036; 

RN [1] 

RP SEQUENCE FROM N.A, 

RA Ramesh G. , Kondaiah P., Seshagiri P.B.; 

RT "Differential expression and selective localization of transforming 

RT growth factor-beta isoforms in the hamster uterus during estrous 

RT cycle."; 

RL Submitted (AUG-2000) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AY007214; AAG02247.1; 

DR HSSP; P08112; 2TGI . 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro ; IPROOllll; TGFb_N. 

DR InterPro; I PRO 03 911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 



DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS 00250; TGF_BETA_1 ; 1. 

FT NON__TER 1 1 

FT NON_TER 3 99 399 

SQ SEQUENCE 399 AA; 46078 MW; A6FF8E65EAFD514 8 CRC64 ; 

Query Match 36.3%; Score 767.5; DB 11; Length 399; 

Best Local Similarity 43.1%; Pred. No. 5.9e-60; 

Matches 177; Conservative 60; Mismatches 121; Indels 53; Gaps 15; 
Qy 18 LLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVP-PGPLPEA 76 

II I I I MM MM: MIMMMMMMM Ml III M 

Db 4 LLHLVP--VALSLSTCSTLDMDQFMRKRIEAIRGQILSKLKLTSPPE--DYPEPDEVPPE 59 

Qy 77 VLALYNSTRD RVAGESVEPEPE- PEADYYAKEVTRVLM VESGNQI YDKFKGTP 128 

h-llllll : : : I I : Ml III I - I III I I 

Db 60 VISIYNSTRDLLQEKASRRAAACERERSDEEYYAKEVYKIDMPSHFPSENAIPPTFY-RP 118 

Qy 129 HSLYMLFNTSELREAVPEPVLLSRAELRLLRL---KLKV-EQHVELYQ KYSNDSW 179 

^ : h I : : I Ml h II I M II Mill 

Db 119 YFRIVRFDVSMMEKNASN---LVKAEFRVFRLQNPKARVAEQRIELYQILKSKDLTSPTQ 175 

Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSK 228 

l|: I II MM I I Ml - I MM I : M 

Db 176 RYIDSKWKTRAEGEWLSFDVTDAVHEWLHHKDRNLGFKISLHCPCCTFVPFNNNIIPNK 235 

Qy 229 DNTLHVE---INGFNSGRRGDLATIHGMNR PFLLLMATPLERAQHLHSSRHRRD 279 

I IM : I II : I MM I I : IM - 

Db 236 SEELEARFAGIDGTSQHSSGHQETIKSTRKKNSGKTPHLLLMLLPSYRLESQQSNRRKK- 294 

Qy 280 YKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWS 339 

MM III : : IIIM llllhMIIIIIIIIIIIIMIII I IIIMI 
Db 295 RALDAAYCFRNVQDNCCLRPLYIDFKRDLGWKWIHEPKGYNANFCAGACPYLWS 348 

Qy 340 LDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNM 390 

llhMIIMII II IIIMIII MM IMIM IIMIIIII 

Db 349 SDTQHTKVLSLYNTINPEASASPCCVSHDLEPLTILYYIGNTPKIEQLSNM 399 



RESULT 11 
Q99K17 

ID Q99K17 PRELIMINARY; PRT; 362 AA. 

AC Q99K17; 

DT Ol-JUN-2001 (TrEMBLrel. 17, Created) 

DT Ol-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 3 (Fragment) . 

GN TGFB 3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 



RA Strausberg R.; 

RL Submitted (MAR-2001) to the EMBL/GenBank/DDBJ databases. 

CC SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC005513; AAH05513.1; -. 

DR HSSP; P10600; ITGJ . 

DR MGD; MGI: 98727; Tgfb3 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N. 

DR InterPro; IPR003911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT. 

DR PRINTS; PRO 14 23; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 362 AA; 41486 MW; 08 08 E4 6 1 8 0FDAE7 0 CRC64 ; 

Query Match 34.4%; Score 727; DB 11; Length 362; 

Best Local Similarity 44.0%; Pred. No. 2.1e-56; 

Matches 164; Conservative 52; Mismatches 113; Indels 44; Gaps 13; 

Qy 58 RLASPPSQGDVPPGPLPEAVLALYNSTR---DRVAGESVE--PEPEPEADYYAKEVTRVL 112 

Db 2 RVGSPPEPSVMT--HVPYQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFD 59 

Qy 113 MVE---SGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVE 165 

h: h: II :: II I : : | ||| |:||: : | 

Db 60 MIQGLAEHNELAVCPKGITSKVFR-FNVSSVEK---NGTNLFRAEFRVLRVPNPSSKRTE 115 

Qy 166 QHVELYQKYSND SWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSA 221 

I :|hl I Ih : I : llllllll Ihll llh I U 

Db 116 QRIELFQILRPDEHIAKQRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISI 175 

Qy 222 H SSSDSKDN- - -TLHVEINGFNS- - -GRRGDLATIHGM- - -NRPFLLLMATPL 265 

I : I :| : :: I :: |||l : : I hll I 

Db 176 HCPCHTFQPNGDILENVHEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNPHLILMMIPP 235 

Qy 266 ERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGY 325 

I h HIIIIIII : hlllll lllllhlllllhllllll 

Db 236 HRLDSPGQGSQRK KRALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGY 289 

Qy 326 HANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVE 385 

:||ll llllh I II :| II III II llhllllll Mil hlllll MM 
Db 290 YANFCSGPCPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVE 349 

Qy 38 6 QLSNMIVRSCKCS 3 98 

MIIMhIMII 

Db 350 QLSNMWKSCKCS 362 

RESULT 12 
Q08714 

ID Q08714 PRELIMINARY; PRT; 130 AA. 

AC Q08714; 070331; 



DT Ol-NOV-1996 (TrEMBLrel . 01, Created) 

DT Ol-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT Ol-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFBl . 

OS Mesocricetus auratus (Golden hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Cricetinae; 

OC Mesocricetus. 

OX NCBI__TaxID=10036; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=LVG (SYR) ; 

RX MEDLINE=93304479; PubMed=83 17544 ; 

RA Wong D.T., Donoff R.B., Yang J., Song B.Z., Matossian K. , Nagura N., 

RA Elovic A., McBride J., Gallagher G., Todd R.; 

RT "Sequential expression of transforming growth factors alpha and beta 1 

RT by eosinophils during cutaneous wound healing in the hamster."; 

RL Am. J. Pathol. 143:13 0-142(1993). 

RN [2] 

RP SEQUENCE OF 26-115 FROM N.A. 

RC STRAIN=SYRIAN; TISSUE=SPLEEN ; 

RX MEDLINE=98234044; PubMed=9573100 / 

RA Melby P.C., Tryon V.V. , Chandrasekar B., Freeman G.L.; 

RT "Cloning of Syrian hamster (Mesocricetus auratus) cytokine cDNAs and 

RT analysis of cytokine mRNA expression in experimental visceral 

RT leishmaniasis . " ; 

RL Infect. Immun. 66:2135-2142(1998). 

CC FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: HOMODIMER, DISULFIDE-LINKED . 

CC -!- SIMILARITY: TO OTHER GROWTH FACTORS OF THE TGF-BETA FAMILY. 

DR EMBL; X60296; CAA42838.1; -. 

DR EMBL; AF046214; AAC40099.1; -. 

DR HSSP; P01137; IKLA. 

DR InterPro; IPR001839; TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT 


NON_TER 


1 


1 




FT 


PROPEP 


<1 


18 




FT 


CHAIN 


19 


130 


TRANSFORMING GROWTH FACTOR 


FT 


DISULFID 


25 


34 


BY SIMILARITY, 


FT 


DISULFID 


33 


96 


BY SIMILARITY. 


FT 


DISULFID 


66 


129 


BY SIMILARITY. 


FT 


DISULFID 


95 


95 


INTERCHAIN (BY SIMILARITY) 


FT 


CONFLICT 


93 


93 


G -> S (IN REF. 2) . 


SQ 


SEQUENCE 


130 AA; 


14997 


MW; 8B41DD6CF39CCA77 CRC64 ; 



Query Match 32.9%; Score 696; DB 11; Length 130; 

Best Local Similarity 92.0%; Pred. No. 2.8e-54; 



Matches 127; Conservative 0; Mismatches 3; Indels 8; Gaps 1; 



Qy 261 MATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIH 320 

IIMIIMIII IIMII llllllllllllllllllllllllllllllllll 

Db 1 MATPLERAQHLQSSRHRR ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIH 52 

Qy 321 EPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGR 38 0 

MIIIIIIMIIIIIIIIIIIIMIIIIIIIIIMIMII lllllllllllllllllll 

Db 53 EPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAGPCCVPQALEPLPIVYYVGR 112 

Qy 381 KPKVEQLSNMIVRSCKCS 3 98 

IIIIIIIIIIIMI III 

Db 113 KPKVEQLSNMIVRSYKCS 13 0 

RESULT 13 
Q98854 

ID Q98854 PRELIMINARY; PRT; 361 AA. 

AC Q98854; 

DT Ol-FEB-1997 (TrEMBLrel . 02, Created) 

DT Ol-FEB-1997 (TrEMBLrel, 02, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 2 (TGF-beta 2) (Fragment) . 

GN TGFB2 . 

OS Cyprinus carpio (Common carp) , 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae; Cyprinus. 

OX NCBI_TaxID=7962 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=HEART; 

RX MEDLINE=97354301; PubMed=9210595 ; 

RA Sumathy K. , Desai K.V. , Kondaiah P.; 

RT "Isolation of transforming growth factor-beta2 cDNA from a fish, 

RT Cyprinus carpio by RT-PCR."; 

RL Gene 191:103-107(1997). 

CC -!- FUNCTION: TGF-BETA 2 HAS SUPPRESSIVE EFFECTS ON INTERLEUKIN-2 
CC DEPENDENT T-CELL GROWTH. 

CC -!- SUBUNIT: HOMODIMER, DI SULFIDE-LINKED (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; U66874 ; AAB62983.1; -. 

DR HSSP; P08112; 2TGI . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPROOllll; TGFb_N. 

DR InterPro; IPR003 911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1, 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

KW Growth factor; Mitogen; Glycoprotein. 

FT NON_TER 1 1 

FT PROPEP <1 257 

FT CHAIN 258 361 TRANSFORMING GROWTH FACTOR BETA 2. 

FT DISULFID 264 273 BY SIMILARITY, 



FT 


DISULFID 




? s 


BY SIMILARITY. 




FT 


DISULFID 


334 


334 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


30 


30 


N-LINKED (GLCNAC. . 


. .) (POTENTIAL) 


FT 


CARBOHYD 


98 


98 


N- LINKED (GLCNAC. . 


. .) (POTENTIAL) 


FT 


CARBOHYD 


199 


199 


N-LINKED (GLCNAC. , 


. .) (POTENTIAL) 


FT 


NON_TER 


361 


361 






SQ 


SEQUENCE 


361 AA; 


41931 MW; 


94D930FA970A3FD3 


CRC64 ; 


Query Match 




32.2%; 


Score 681/ DB 13; 


Length 361; 



Best Local Similarity 41.3%; Pred. No. 2.7e-52; 

Matches 155; Conservative 55; Mismatches 115; Indels 50; Gaps 14; 

Qy 52 QILSKLRLASPPSQGDVPPGP--LPEAVLALYNSTRDRVAGESVEPEPEPE ADYY 104 

III Ihh II I I : -hllllll : - I I Ul 

Db 1 QILCKLKLSCPP---EIYPEPEEVSRDIIAIYNSTRDLLQEKANERAATCERQRTGEEYY 57 

Qy 105 AKEVTRVLM---VESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL- 160 

MM - I : I I I |: : h I : : I Mllh II 

Db 58 AKEVHKIDMQPFYPAENVIPTKHY-NPYFRRLRFDVSSMEKNASN---LVKAELRIFRLQ 113 

Qy 161 --KLKV-EQHVELYQ KYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRRE 212 

I M II MM! : Ih llllllll I Ml h 

Db 114 NPKARVSEQRIELYQILGHKDLTSPTQRYIDSKWRTRTEGEWLSFDVTEAVSEWLLHRD 173 

Qy 213 AIEGFRLSAHSSS DSKDNTLHVEINGFNSG- -RRGDLATI HGMNR 255 

IMM I M I I : III I 

Db 174 RNNGFKISLHCPCCTFVPSNNYIIPNKSEELEARFAGIDDSFVHGGDLKMFKKRRHSGQS 233 

Qy 256 PFLLLMATPLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLG 315 

I MM I I : I I Ih MM Ml : : IMM IIIIIMIII 

Db 234 PHLLLMLLPSYRLESQHKS-HRQ KRALDAAFCFRNVQDNCCLRSLYIDFKKDLG 286 

Qy 316 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 375 

IIIMIMIIMIII I IIIMI IMM M III II IIIMMI I MM h 

Db 287 WKWIHEPKGYNANFCAGACPYLWSADTQHSNILGLYNTINPEASASPCCVSQDLEPLTIL 34 6 

Qy 376 YYVGRKPKVEQLSNM 3 90 

MM: Ihllllll 
Db 347 YYIGKTPKIEQLSNM 361 



RESULT 14 
Q95N80 

ID Q95N80 PRELIMINARY; PRT; 124 AA. 

AC Q95N8 0; 

DT Ol-DEC-2001 (TrEMBLrel . 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1 (Fragment) . 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; 

OX NCBI_TaxID=9615; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Fonfara S., Groene A., Baumgaertner W.; 

RT "Sequence of canine transforming growth factor beta 1 mRNA in DH82- 



Euteleostomi; 
Canis . 



RT cells."; 

RL Submitted (FEB-2001) to the EMBL/GenBank/DDBJ databases. 

CC SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF34 9538; AAK54072.1; 

DR InterPro; IPR001839; TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

FT NON_TER 1 1 

FT NON_TER 124 124 

SQ SEQUENCE 124 AA; 14329 MW; 2 1D1852 18E5556DB CRC64 ; 

Query Match 32.1%; Score 679; DB 6; Length 124; 

Best Local Similarity 93.2%; Fred. No. 8.5e-53; 

Matches 123; Conservative 0; Mismatches 1; Indels 8; Gaps 1; 

Qy 264 PLERAQHLHSSRHRRDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPK 323 

Illlllllllll II llllllllllllllllilllllMIIIIIIIIIIIM 
Db 1 PLERAQHLHSSRQRR ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPK 52 

Qy 324 GYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPK 383 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 53 GYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPK 112 

Qy 384 VEQLSNMIVRSC 395 

Illlllllllll 
Db 113 VEQLSNMIVRSC 124 

RESULT 15 
002730 

ID 002730 PRELIMINARY; PRT; 112 AA. 

AC 002730; 097501; 

DT 01- JUL- 1997 (TrEMBLrel . 04, Created) 

DT Ol-JUL-1997 (TrEMBLrel. 04, Last sequence update) 

DT Ol-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFBl OR TGF-BETA-1. 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Taylor T.K., James E.R., McGonigle S., Yoho E.R.; 

RL Submitted (APR-1997) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 2-99 FROM N.A. 

RA Inoue K. , Kawabe Y. , Kodama T.; 

RL Submitted (NOV-1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE, TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 



CC -!- SUBUNIT: HOMODIMER, DISULFIDE-LINKED (BY SIMILARITY) . 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF000133; AAB53806.1; 

DR EMBL; AB020217; BAA36950.1; 

DR HSSP; P01137; IKLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb . 

DR Pfam; PF00019; TGF-beta; 1, 

DR PRINTS; PRO 043 8; GFCYSKNOT. 

DR ProDoni; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT 


NON_TER 


1 


1 




FT 


CHAIN 


1 


112 


TRANSFORMING GROWTH FACTOR BETA 1 


FT 


DISULFID 


7 


16 


BY SIMILARITY. 


FT 


DISULFID 


15 


78 


BY SIMILARITY. 


FT 


DISULFID 


44 


109 


BY SIMILARITY. 


FT 


DISULFID 


48 


111 


BY SIMILARITY, 


FT 


DISULFID 


77 


77 


INTERCHAIN (BY SIMILARITY) . 


FT 


CONFLICT 


2 


3 


LD -> FS (IN REF. 2) . 


FT 


CONFLICT 


85 


92 


PLPIVYYV -> ATAHRVTTL (IN REF. 2) 


SQ 


SEQUENCE 


112 AA; 


12795 


MW; 53C5B7D46355A6F3 CRC64 ; 



Query Match 30.2%; Score 638; DB 6; Length 112; 

Best Local Similarity 100.0%; Pred . No. 3.3e-49; 

Matches 112; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 287 ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSK 346 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 
Db 1 ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSK 60 

Qy 347 VLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIMMIIIIIIIIMIMIIIIIIIIIIIIIIIMIMIIIIIIII 

Db 61 VLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 112 



Search completed: October 28, 2003, 09:12:23 
Job time : 33.909 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



October 27, 2003, 10:47:27 ; Search time 1415.77 Seconds 

(without alignments) 
10489.161 Million cell updates/sec 

US-10-017-372E-10 
363 

1 gactacaaggatgacgacga gttcctgcaagtgcagctga 363 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl : * 

1 : gb_ba : * 

2 : gb_htg : * 

3 : gb_in : * 

4 : gb_om : * 

5 : gb_ov : * 

6: gb_pat:* 

7 : gb_ph : * 

8 : gb_pl : * 

9 : gb_pr : * 

10: gb_ro:* 

11: gb_sts:* 

12: gb_sy:* 

13: gb_un:* 

14: gb_vi:* 

1 5 : em_ba : * 

16: em_fun:* 

17: em__hum:* 

18: em_in:* 

1 9 : em__mu : * 

2 0 : em_om : * 

2 1 : em_or : * 

22: em_ov:* 

23: em_pat:* 

24: em_ph:* 

25: em_pl:* 

26: em_ro:* 

27: em StS : * 



28 


em__un : * 


2 9 


etn vi : * 


3 0 


em htg hum : * 


3 1 


em_htg__inv : * 


32 


em_htg_other : * 




em ncy mus : 


34 


em_htg_pln : * 


35 


em__htg_rod : * 


36 


em_htg_mam: * 


37 


em_htg_vrt : * 


38 


em_sy : * 


39 


em_htgo_hum : * 


40 


em_htgo_mus : * 


41 


em_htgo_other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



NO , 


Score 


Match 


Length 


DB 


ID 


Descr ipt ion 


1 


"3 O Q 


93 


4 


1326 


5 


Aa3 J 82 13 


AaJ3o213 bequence 


2 


339 


93 


4 


1605 


4 


SSTGFBR 


YOOlll Porcine mRN 




J J y 


93 


4 


1 "1 c r\ 

1 /bU 


4 


(jUlbrbl 


V T OTTO -11^^ rMrS M 

Ai2o IS rorcme mKJM 


4 


339 


93 


4 


2221 


4 


AF461808 


AF461808 Sus scrof 


5 


339 


93 


4 


3206 


4 


PIGTGFBIA 


M23703 Sus scrofa 


6 


312.8 


86 


2 


489 


6 


AX455100 


AX455100 Sequence 


7 


312.8 


86 


2 


1369 


4 


DOGTGFBIA 


L34956 Canine tran 


8 


312.2 


86 


.0 


1173 


4 


OATGFBl 


X76916 O.aries mRN 


9 


311.8 


85 


9 


1561 


9 


AGMTGFB 


M16658 Simian tran 


10 


310.2 


85 


5 


339 


6 


AR036686 


AR036686 Sequence 


11 


310.2 


85 


.5 


1560 


6 


108268 


108268 Sequence 2 


12 


309.2 


85 


.2 


1560 


6 


106216 


106216 Sequence 2 


13 


309 


85 


1 


1117 


4 


BOVTGFB 


M36271 Bovine tran 


14 


308.6 


85 


0 


339 


4 


OCAF000133 


AF000133 Oryctolag 


15 


308 .6 


85 


0 


339 


6 


A18277 


A18277 H. sapiens T 


16 


308 .6 


85 


0 


339 


6 


A23751 


A23751 TGF-betal c 


17 


308 .6 


85 


.0 


339 


6 


A48549 


A4854 9 Sequence 1 


18 


308 . 6 


85 


0 


339 


6 


A48563 


A48563 Sequence 1 


19 


308.6 


85 


0 


339 


6 


156854 


156854 Sequence 1 


20 


308.6 


85 


0 


650 


6 


AX336646 


AX336646 Sequence 


21 


308 . 6 


85 


0 


650 


9 


HUMTGFBA 


M38449 Homo sapien 


22 


308 . 6 


85 


0 


699 


6 


105434 


105434 Sequence 4 


23 


308.6 


85 


0 


862 


6 


103312 


103312 Sequence 3 


24 


308.6 


85 


0 


1176 


6 


AX481432 


AX481432 Sequence 


25 


308.6 


85 


0 


1176 


6 


AX615127 


AX615127 Sequence 


26 


308 . 6 


85 


0 


1176 


6 


AX615128 


AX615128 Sequence 


27 


308.6 


85 


0 


1746 


9 


BC022242 


BC022242 Homo sapi 


28 


308.6 


85 


0 


1780 


9 


BC000125 


BC000125 Homo sapi 


29 


308 . 6 


85 


0 


1780 


9 


BC001180 


BC001180 Homo sapi 


30 


308.6 


85 


0 


1821 


6 


E03028 


E03028 DNA encodin 


31 


308.6 


85 


0 


2537 


6 


A06669 


A06669 Synthetic m 


32 


308.6 


85 


0 


2745 


9 


HSTGFBl 


X02812 Human mRNA 


33 


308-2 


84 


9 


337 


6 


AR198575 


ARl 98575 Sequence 



"3 A 


3 07 


84 , 


. 6 


1 /f /T 

1265 


6 


AX455875 


AX455875 Sequence 


J b 


o ^7 


84 . 


. b 


2 527 


6 


r? A A n T 1 

h0u97i 


E00973 cDNA encodi 


J b 




O A 

84 , 


. D 


1 i / J 


y 


BiuU /z4d 


BT007245 Homo sapi 


37 


306.6 


84 


. 5 


1 173 


12 


BT007866 


BT007866 Synthetic 


38 


306 


84 


.3 


1569 


6 


106221 


106221 Sequence 3 


39 


303-8 


83, 


.7 


469 


10 


MATGFBl 


X60296 M.auratus m 


40 


303 .6 


83 


.6 


334 


4 


SHPGFBIW 


L36038 Ovis aries 


41 


302.8 


83 , 


.4 


374 


4 


AF349538 


AF349538 Canis fam 


42 


302.2 


83 , 


.3 


1173 


4 


ECRGFBl 


X99438 E.caballus 


43 


300.6 


82, 


.8 


1187 


4 


AF175709 


AF175709 Equus cab 


44 


300.6 


82 , 


.8 


1597 


10 


AF191297 


AF191297 Cavia por 


45 


287.8 


79. 


.3 


1579 


10 


MUSTGFRNA 


M13177 Mouse crans 



ALIGNMENTS 



RESULT 1 
AX338213 

LOCUS AX338213 1326 bp DNA linear PAT 09-JAN-2002 

DEFINITION Sequence 1 from Patent WO0181404, 
ACCESSION AX338213 

VERSION AX338213.1 GI: 18128750 

KEYWORDS 

SOURCE Sus scrofa (pig) 

ORGANISM Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebra ta; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 
REFERENCE 1 

AUTHORS Strober.W., Nakamura,K., Kitani,A. and Fuss, I. J. 

TITLE Inducible plasmid vector encoding tgf-_g(b) and uses thereof 

JOURNAL Patent: WO 0181404-A 1 Ol-NOV-2001; 

THE SECRETARY OF THE DEPARTMENT OF HEALTH AND HUMAN SERVICES (US) 
FEATURES Locat ion/Qua 1 i f iers 

source 1. .1326 

/organism="Sus scrofa" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 9823" 
CDS 16. .1188 

/note=" unnamed protein product" 
/codon_start=l 
/protein_id="CAD20538 . 1" 
/db_xref ="GI : 18128751" 

/translation="MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 

BASE COUNT 263 a 438 c 392 g 233 t 

ORIGIN 



Query Match 93.4%; Score 339; DB 6; Length 1326; 

Best Local Similarity 100.0%; Pred. No. 1.4e--60; 

Matches 33 9; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

MIIIIIIIIIIIIMIIIIIIIMIIMIIIIIIIIIMMIIIMIIIMIIIIIIII 

Db 8 50 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 909 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIMIIIIIIIIIII 

Db 910 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 969 

Qy 14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

MMMIIIIMIMIMMIIIIMIMIMMIIIIIIMIMMIMMIMIIIM 

Db 970 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 102 9 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

MIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIII 

Db 103 0 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 108 9 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIMIII 

Db 10 9 0 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 114 9 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIIIMlllllllillllllMIIIMIIIIIIIIIIII 

Db 1150 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1188 



RESULT 2 

SSTGFBR 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 
FEATURES 

source 



CDS 



SSTGFBR 1605 bp mRNA linear MAM 27-MAR-1995 

Porcine mRNA for transforming growth factor-beta (TFG) precursor. 
YOOlll 

YOOlll.l GI:2129 

transforming growth factor-beta. 

Sus scrofa (pig) 

Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 
1 (bases 1 to 1605) 
Derynck,R- and Rhee,L. 

Sequence of the porcine transforming growth factor-beta precursor 

Nucleic Acids Res. 15 (7), 3187 (1987) 

87174844 

3470708 

Location/Qualif iers 
1. ,1605 

/organi sm= " Sus scrofa " 
/mol_type=" mRNA " 
/db_xref="taxon: 9823" 
/tissue__type=" ovary" 
/clone_lib= "lambda gtlO" 
404. .1576 

/note="TFG-beta precursor (AA 1-390)" 

/codon_start=l 

/protein_id="CAA68291 .1" 

/db_xref ="GI :2130" 

/db_xref ="SWISS-PROT: P07200" 

/trans lation="MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 



BASE COUNT 
ORIGIN 



mat_peptide 
misc_f eature 
mi sc__ feature 

298 a 



YYAKEVTRVLMLESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 

KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 

LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 

RPALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 

QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 

1238. .1573 

/product= "mature TFG-beta" 
647. .655 

/note="pot. N-glycosylation site" 
809. .817 

/note="pot. N-glycosylation site" 
572 c 437 g 298 t 



Query Match 93.4%; Score 33 9/ DB 4; Length 1605; 

Best Local Similarity 100.0%; Pred. No. 1.4e-60; 

Matches 339; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 


25 


Db 


1238 


Qy 


85 


Db 


1298 


Qy 


145 


Db 


1358 


Qy 


205 


Db 


1418 


Qy 


265 


Db 


1478 


Qy 


325 


Db 


1538 



GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 8 4 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 12 97 
TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 1357 
GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

IIIMMMIMIMIIIMMMIMIIIIIMIIIIMIMIIIMIMIMIMIM 

GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 1417 

GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

MMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 1477 
CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1537 
CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

MMMIIIMIIIIIIMIMIIMIMMMIIIIM 

CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1576 



RESULT 3 

GGTGFBl 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



GGTGFBl 1750 bp mRNA linear MAM 27-MAR-1996 

Porcine mRNA for transforming growth factor-beta 1. 

X12373 

X12373.1 GI: 63808 

transforming growth factor-beta 1. 

Sus scrofa (pig) 

Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Cetart lodactyla ; Suina; Suidae; 
1 (bases 1 to 1750) 

Jakowlew,S.B. , Dillard, P. J. , Sporn,M.B. and Roberts, A. B. 
Nucleotide sequence of chicken transforming growth factor-beta 1 



Euteleostomi ; 
Sus . 



(TGF-beta 1) 

JOURNAL Nucleic Acids Res. 16 (17), 8730 (1988) 
MEDLINE 88335639 
PUBMED 3166520 
REFERENCE 2 (bases 1 to 1750) 
AUTHORS Jakowl ew , S . B . 
TITLE Direct Submission 

JOURNAL Submitted (14-JUL-1988) Jakowlew S . B . , National Institute of 

health, National Cancer Institute, Laboratory of Chemoprevention, 
Building 41, Room B902, Bethesda, Maryland 20892, USA 
COMMENT The submitters believe that the chicken cDNA library was 

contaminated with porcine cDNA, and that the sequence is infact 
porcine TGF-beta-1. 27-MAR-1996. 
FEATURES Loca t ion/Qual i f iers 

source 1. .1750 

/ organi sm= " Sus scrof a " 
/mol_type=" mRNA " 
/strain="white leghorn" 
/db_xref="taxon: 9823" 
/ c 1 one - " pTGFB - ChX 119" 
/eel l_type= " chondrocyte " 
gene 1, .1750 

/gene= "TGF-beta 1" 
5'UTR 1. .446 

/gene=" TGF-beta 1" 
CDS 447. .1622 

/gene= "TGF-beta 1" 
/ codon_s t a r t = 1 

/product="transforming growth factor" 
/protein_id="CAA30933, 1" 
/db_xref="GI : 63809" 
/db_xref = " SWI SS - PROT : PO 72 0 0 " 

/ 1 rans 1 a t ion= " MPPSGPGLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWGYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINAGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSR 
HRRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLD 
TQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
mat_peptide 1285. .1619 



3 'UTR 

BASE COUNT 
ORIGIN 



325 a 



/gene= "TGF-beta 1" 
1620. .1750 
/gene= "TGF-beta 1" 
627 c 479 g 



319 t 



Query Match 93.4%; Score 339; DB 4; Length 1750; 

Best Local Similarity 100.0%; Pred. No. 1.3e-60; 

Matches 339; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 2 5 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIMIIIIIIII 

Db 1284 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 1343 



Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IIIIIIIIIIMIIIIIIIMIIIIIIIMMIIIIIIIIIIIIIIMIIIIIIIMIM 

Db 1344 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 14 03 



Qy 


145 


GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 

MIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIMIIIIIIIIII 

GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGC^ 


204 


Db 


1404 


1463 


Qy 


205 


GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 
1 1 1 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 i 1 1 1 1 1 1 


264 


Db 


1464 


1 M M M M 1 M M M M M M M M 1 M 1 M M 1 M M 1 1 1 1 1 1 1 1 M 

GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 


1523 


Qy 


265 


CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 

lllllllllllllllMllillllllMIIIIIIIIMMIMIMIIIMMMIIIM 

CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 


324 


Db 


1524 


1583 


Qy 


325 


CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 




Db 


1584 


MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1622 





RESULT 4 
AF461808 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



AF461808 2221 bp mRNA linear MAM 03-JAN-2002 

Sus scrofa transforming growth factor beta 1 (TGFBl) mRNA, complete 
cds . 

AF461808 

AF4 61808 .1 GI : 18042250 

Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 

1 (bases 1 to 2221) 

Wimmers,K., Chomde j , S . , Ponsuksil i , S . and Schellander , K. 
Polymorphism in the porcine transforming growth factor beta 1 gene 
Unpublished 

2 (bases 1 to 2221) 

Wimmers,K., Chomde j , S . , Ponsuksil i , S . and Schellander, K. 
Direct Submission 

Submitted (20-DEC-2001) Institute of Animal Breeding Science, 
University of Bonn, Endenicher Allee 15, Bonn 53115, Germany 

Locat ion/Qua 1 i f iers 

1. .2221 

/organism="Sus scrofa" 
/mol_type= " mRNA" 
/db_xref="taxon: 9823" 
/ chromosome= " 6 " 
1. .2221 
/gene = "TGFBl" 
1. .1173 
/gene= "TGFBl" 
/note=" cytokine" 
/ codon_s t a r t = 1 

/product =" trans forming growth factor beta 1" 
/protein_id="AAL57902 .1" 
/db_xref ="GI : 18042251" 

/translation="MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 



3 'UTR 

BASE COUNT 
ORIGIN 



KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVIALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
1174. .2221 
/gene="TGFBl" 
504 a 616 c 649 g 452 t 



Query Match 93.4%; Score 33 9; DB 4; Length 2221; 

Best Local Similarity 100.0%; Pred. No. 1.3e-60; 
Matches 339; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

MIIIIIIIIIMIIIIIIIIIIIIIIMIIIMIIIIIIIIMIIIIIIIIIIIIIIII 

Db 835 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 8 94 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 8 95 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 954 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

IIIIMMIIIMIIIIMMMIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIII 

Db 955 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 1014 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIMIIIIMIIIIII 

Db 1015 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 1074 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1075 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1134 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

lllllllllllllllllllllllllllllllllllllll 
Db 113 5 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1173 



RESULT 5 

PIGTGFBIA 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 



PIGTGFBIA 3206 bp mRNA linear MAM 31-MAR-1995 

Sus scrofa transforming growth factor beta-1 mRNA, complete cds. 

M23703 

M23703 . 1 GI :755044 
transforming growth factor-beta-1 . 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla; Suina; Suidae; Sus. 

1 (bases 1 to 3206) 

Kondaiah,P., Van Obberghen-Schilling, E . , Ludwig,R.L., Dhar,R., 
Sporn.M.B. and Roberts, A. B. 

cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 

Evidence for alternate splicing and polyadenylation 

J. Biol. Chem. 263 (34), 18313-18317 (1988) 

89054010 

2461367 



COMMENT 



On Apr 1 
Original 
mRNA. 



, 1995 this sequence version replaced gi: 341017. 
source text; Sus scrofa (strain miniature swine) 



cDMA to 



FEATURES 

source 



gene 



CDS 



polyA_signal 
polyA_site 



BASE COUNT 
ORIGIN 



Location/Qualifiers 
1. .3206 

/organ ism=" Sus scrofa" 
/mol_type= "mRNA" 
/strain="miniature swine" 
/db_xre f = " t axon : 9 8 2 3 " 

/cell_type="peripheral blood lymphocyte" 
1. .3206 

/gene="TGF-beta-l" 
906. .2078 
/gene= " TGF-beta - 1 " 
/codon_start=:l 

/product="transforming growth f actor-beta-1" 
/protein_id="AAA64616. 1" 

/db_xref ="GI : 755045" 

/translation="MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
JCLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSPCVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
3186. .3191 
/gene= "TGF-beta- 1" 
3206 

/gene= "TGF-beta - 1 " 
645 a 1041 c 924 g 596 t 



Query Match 93.4%; Score 339; DB 4; Length 32 06; 

Best Local Similarity 100.0%; Pred. No. 1.3e-60; 

Matches 339; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

IIIIIIIIIIIIIIIMIIIilllllMIIIIIIMIIIIIIIIIMIIIMIIIIIIII 

Db 174 0 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 1799 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IIIIMIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIMIIIMI 

Db 18 00 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 1859 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

MIIIIMMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

Db 1860 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 1919 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIMIIIIIIIIIMII 

Db 192 0 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 197 9 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIMMIMMIIIIIMIMMIIlMIIIIMIMMIIIMIMIIIIililllll 

Db 1980 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 2 039 



Qy 



325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 



Db 2 04 0 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 2 078 



RESULT 6 
AX455100 

LOCUS AX455100 489 bp DNA linear PAT 06-JUL-2002 

DEFINITION Sequence 167 from Patent WO0208453. 
ACCESSION AX455100 

VERSION AX455100.1 GI:21714285 

KEYWORDS 

SOURCE Canis familiaris (dog) 

ORGANISM Canis familiaris 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

REFERENCE 1 

AUTHORS Farr,S.B., Pickett , G . G . , Neft,R.E. and Dunn, R.T. 
TITLE Canine toxicity genes 

JOURNAL Patent: WO 0208453-A 167 31 - JAN-2002 ; 
Phase-1 Molecular Toxicology (US) 
FEATURES Loca t ion/Qual i f i ers 

source 1. .48 9 

/organism="Canis familiaris" 
/mol__type=" genomic DNA" 
/db_xref ="taxon: 9615" 
BASE COUNT 84 a 183 c 145 g 76 t 1 others 

ORIGIN 

Query Match 86.2%; Score 312.8; DB 6; Length 489; 

Best Local Similarity 95.0%; Pred. No. 4.3e-55; 

Matches 323; Conservative 0; Mismatches 17; Indels 0; Gaps 0; 

Qy 24 GGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCT 83 

lllllllll IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII llllllll 
Db 71 GGCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTCCGGCAGCT 13 0 

Qy 84 CTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCA 143 

IIIIIMIIIIIIM Mill IIIIIMIIMIIIIII IIIM MIIIIM MM! 

Db 131 CTACATTGACTTCCGCAAGGATCTGGGCTGGAAGTGGATCCATGAGCCCAAGGGTTACCA 190 

Qy 144 TGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAA 203 

II II lllllllllllllllll llllllll llllllll Mill IIIIIMIIII 

Db 191 CGCTAACTTCTGCCTGGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAA 250 

Qy 204 GGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCC 2 63 

MMIIIII IIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIII 

Db 251 GGTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCC 310 

Qy 264 GCAGGCGCTGCiAGCCAClUCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA 323 

MMMMMMIMIMMIIMMMMMMMMIIIMMMMIMMMMM 

Db 311 GCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA 370 

Qy 324 GCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

Mill! IIIIIMIIIIMI MIMIIIIIIIIIMII 

Db 371 GCTGTCGAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 410 



RESULT 7 

DOGTGFBIA 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 

FEATURES 

source 



gene 



5 »UTR 



CDS 



Euteleostomi ; 
Canis . 



mat_peptide 



3 'UTR 



BASE COUNT 
ORIGIN 



DOGTGFBIA 1369 bp mRNA linear MAM 30-OCT-1994 

Canine transforming growth factor-beta 1 (TGFBl) mRNA, complete 
cds . 
L34956 

L34956.1 GI:516071 

homologue; transforming growth factor-beta 1. 
Canis familiaris (dog) 

Canis familiaris 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; 
1 (bases 1 to 1369) 

Manning, A. M, , Auchampach, J . A. , Drong,R,F. and Slightom, J.L. 
Cloning of a canine cDNA homologous to human transforming growth 
factor-beta 1 (TGFbetal) 
Unpublished (1994) 

Original source text: Canis familiaris adult jugular vein 
endothelial cDNA to mRNA. 

Locat ion/Qua 1 i f i ers 

1. .1369 

/organism=" Canis familiaris" 

/mol_type="mRNA" 

/db_xref="taxon: 9615" 

/cell_type="LPS-activated" 

/tissue_type=" jugular vein endothelial" 

/dev_s tage= " adult " 

1- .1369 

/gene=" TGFBl" 

1. .57 

/gene=" TGFBl" 
58. .1230 
/gene= "TGFBl" 

/ f unct ion= "ant i - inflammatory agent " 
/not e= "precursor" 
/codon__start = l 

/product=" transforming growth factor-beta 1" 
/protein_id="AAA51458 .1" 

/db_xref ="GI : 516072" 

/translation="MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLSSPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVENTNKIYEKVKKSPHSIYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKAEQHVELYQKYSNDSWRYLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFR 
LSAHCSCDSKDNTLQVDINGFSSSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQ 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
1039. .1227 
/gene=" TGFBl" 

/product=" transforming growth factor-beta 1" 
1231. .1369 
/gene=" TGFBl" 
264 a 473 c 415 g 216 t 1 others 



Query Match 86.2%; 
Best Local Similarity 95.0%; 
Matches 323; Conservative 



Score 312 . 8 ; DB 4 ; 
Pred. No. 3,9e''55; 
0; Mismatches 17; 



Length 1369; 

Indels 0; Gaps 0; 



Qy 24 GGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCT 83 

lllllllll lllllllllllllllllllllllllllllllllllllllll llllllll 
Db 891 GGCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTCCGGCAGCT 950 

Qy 84 CTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCA 143 

lllllllllllllll Mill IIIIIIIIIIIMIIII Mill IIMIIII Mill 

Db 951 CTACATTGACTTCCGCAAGGATCTGGGCTGGAAGTGGATCCATGAGCCCAAGGGTTACCA 1010 

Qy 144 TGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAA 203 

M M IIIIMIIIIMIMII MMIIII IIIIIMI Mill lllllllllll 

Db 1011 CGCTAACTTCTGCCTGGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAA 1070 

Qy 2 04 GGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCC 263 

IIMIIIII IIMIIMMMIIIMIIIIMIIIIIIMMIIIIIIIIIIIIIMII 

Db 1071 GGTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCC 1130 

Qy 264 GCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA 323 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMMIMIIIIIIIIIIIIII 

Db 1131 GCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA 1190 

Qy 324 GCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIMII IMIIIIMIIIM IIIIIIIIIIIMMIM 

Db 1191 GCTGTCGAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 123 0 



RESULT 8 

OATGFBl 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Cetartiodactyla; Ruminant ia; Pecora; Bovoidea; 
Ovis . 



CDS 



OATGFBl 1173 bp mRNA linear MAM 18-APR~1995 

O.aries mRNA for transforming growth factor-beta I. 

X76916 

X76916.1 GI:496648 

TGF-beta 1; transforming growth factor-beta 1. 
Ovis arias (sheep) 
Ovis aries 
Eukaryota; Metazoa 
Mammal ia ; Eutheria 
Bovidae ; Capr inae ; 
1 

Woodall^C. J. , McLaren, L.J. and Watt, N.J. 

Sequence and chromosomal localisation of the gene encoding ovine 
latent transforming growth factor-beta 1 
Gene 150 (2), 371-373 (1994) 
95121932 
7821809 

2 (bases 1 to 1173) 
Woodall,C. 
Direct Submission 

Submitted (24-DEC-1993) C. Woodall, Univ. of Edinburgh, Dept. of 
Veterinary Pathology, Sc. of Vet. Studies, Univ. of Edinburgh, 
Edinburgh EH 9 IQH, UK 

Loca t i on/ Qua 1 i f i er s 

1. .1173 

/organism="Ovis aries" 
/mol_type="mRNA" 
/db_xref ="taxon: 9940" 
1. .1173 



mat_peptide 



BASE COUNT 
ORIGIN 



/ codon_s t a r t = 1 

/product^" transforming growth factor-beta 1" 
/protein_id="CAA54242 .1" 

/db_xref="GI : 496649" 
/ db_xr e f = " SWI SS - PROT : P5 0 4 1 4 " 

/ 1 rans 1 a t i on= " MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKT I DMELVKRK 
GIEAIRGQILSKLRLASPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEAD 
YYAKEVTRVLMVEYGNKIYDKMKSSSHSIYMFFNTSELREAVPEPVLLSRADVRLLRL 
KLKVEQHVELYQKYSNNSWR YLSNRLLAPSDS PEWLSFDVTGWRQWLTHREE I EGFR 
LSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
835. .1170 

/product =" transforming growth factor-beta 1" 
245 a 378 c 336 g 214 t 



Query Match 86 . 0%; 

Best Local Similarity 93.4%; 
Matches 326; Conservative 



Score 312.2; DB 4; 
Pred. No. 5.3e-55; 
0; Mismatches 23; 



Length 1173; 
Indels 0; 



Gaps 



0; 



Qy 

Db 



15 CGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGT 74 

I II I Mllllll IIIIIIIIIIIMIIIIIIIIII II lllllllllll II 

82 5 CCACCGCCGAGCCCTGGACACCAACTACTGCTTCAGCTCCACAGAAAAGAACTGCTGTGT 884 



Qy 

Db 



75 GCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAA 134 

II llllllllllllllllllllllllilMIIIIIIIIIIIIMIIIII llllllll 

885 TCGTCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCACGAACCCAA 944 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



135 GGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCA 194 

lllllllll lllllllllllllllllllllllllllllllllllllllll Mill II 

94 5 GGGCTACCACGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTGGACACACA 1004 

195 GTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTG 254 

llllllllllllllllll IIIIIIIIIIIIIIIMIIIIIIIII IMIIIIIIIIIII 
1005 GTACAGCAAGGTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCATCGGCGGCGCCGTG 1064 



255 



314 



CTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAA 

lllllllll lllllllllll II IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

1065 CTGCGTGCCTCAGGCGCTGGAACCCCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAA 1124 



315 



363 



GGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 

llllllllll IIIIIIIIMIIIIIIIII llllllllllllllllll 
1125 GGTGGAGCAGTTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1173 



RESULT 9 

AGMTGFB 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



AGMTGFB 1561 bp mRNA linear PRI 27-APR-1993 

Simian transforming growth factor-beta (TGF) mRNA, complete cds . 
M16658 

M16658.1 GI: 176552 

growth factor; transforming growth factor-beta. 
Cercopithecus aethiops (African green monkey) 
Cercopithecus aethiops 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 

FEATURES 

source 



CDS 



sig_peptide 
mat_peptide 



BASE COUNT 
ORIGIN 



Cercopithecinae; Cercopithecus . 
1 (bases 1 to 1561) 

Sharpies, K., Plowman , G . D . , Rose,T.M., Twardzik, D . R . and 

Purchio, A, F. 

Cloning and sequence analysis of simian transforming growth 

factor-beta cDNA 

DNA 6 (3) , 239-244 (1987) 

87246074 

3474130 

Original source text: African green monkey cells (cell line 
BSC-40) , cDNA to mRNA, clone pTGF-beta-2. 
Locat ion/ Qual i f iers 
1. .1561 

/organi sm= " Cercopi thecus aethiops " 
/mol_type="mRNA" 
/db_xref ="taxon: 9534 " 
262. .1434 

/note=" trans forming growth factor-beta precursor" 
/ codon_s t a r t = 1 
/protein_id="AAA35369 .1" 
/db_xref="GI : 176553" 

/translation='"MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTIDMELVKRK 
RIETIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEIYDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSKDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
262. .1095 

/note=" transforming growth factor-beta signal peptide" 
1096. ,1431 

/product=" transforming growth factor-beta" 
301 a 547 c 446 g 267 t 



Query Match 85.9%; Score 311.8; DB 9; Length 1561; 

Best Local Similarity 95.0%; Pred. No. 6.2e-55; 

Matches 322; Conservative 0; Mismatches 17; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1096 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCIAGCTG 1155 



Qy 

Db 

Qy 
Db 

Qy 
Db 



85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

II lllllllllll Mllllll llllllllllllll II II lllllllllllllll 

1156 TATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 1215 



145 



204 



GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 

Mill lllllllllillllllllllllllM llllllll Mill llllllllllll 

1216 GCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 1275 
2 05 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

IMIIMI IIIIMIIIIIIII lllllllllll IIIIIIIIIIIIIIIMIIIIIIi 

1276 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 1335 



Qy 



265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIIIIIIIIIIIIIMMIIIMIIIIIIIIIIIIIIIMIIIIMIIIIIIIIMIII 



Db 1336 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1395 



Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IMMMIMIIIMIMM IIIMIM MIIIMM 

Db 1396 CTGTCCAACATGATCGTGCGCTCCTGCAAATGCAGCTGA 1434 



RESULT 10 

AR036686 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
FEATURES 

source 

BASE COUNT 
ORIGIN 



AR036686 339 bp DNA linear PAT 29-SEP-1999 

Sequence 29 from patent US 5800811. 

AR036686 

AR036686.1 GI:5954542 

Unknown . 

Unknown . 

Unclassif ied- 

1 (bases 1 to 339) 

Hall,F.L.; Nimni;M.E., Tuan,T.-L., Wu,L. and Cheung, D.T. 
Artificial skin prepared from coclagen matrix containing 
transforming growth factor- . beta . having a collagen binding site 
Patent: US 5800811-A 29 Ol-SEP-1998; 

Location/Qualifiers 

1. .339 

/ o rgan i s m= " unknown " 
66 a 113 c 100 g 60 t 



Query Match 85.5%; 
Best Local Similarity 94.7%; 
Matches 321; Conservative 



Score 310.2; DB 6; 
Pred. No. 1.5e-54; 
0; Mismatches 18; 



Length 33 9; 



Indels 



0 ; Gaps 



Qy 

Db 



0; 



25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

IIIMIM MIMIII IIIIMIMIIMIMMMIMIIIIIIIIIIIIIIIIII 

1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 60 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



8 5 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

MIIIIIIMIIII MIMIII IIIMIMIMIII Mill MIIIIIIIIMIII 

61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCATGAGCCCAAGGGCTACCAT 12 0 
145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

MM! MIMIII IMIMII IMIIIII IIMMII MM! IMIIIIIIIII 

121 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 18 0 

2 05 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

MIMIII MIIIIIIMIIM IMIIIIIIII IIMMIIMMIIIIIMIIIII 

181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 24 0 
265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

Mllllllllllll IIIIIIIMIMMIIIIIIIIIIIIMMIMIIIIIMIIIII 

241 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 00 
325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

MIIIIIIIIMIIMIIII IIIIIIIIIIIIIMIM 

3 01 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 33 9 



RESULT 11 

108268 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 

BASE COUNT 
ORIGIN 



108268 
Sequence 
108268 
108268.1 



2 from Patent 



GI : 589017 



1560 bp 
EP 0373994 



DNA 



linear PAT 02-DEC-1994 



Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 1560) 

Purchio,A. F. , Gentry, L., Twardzik,D, and Brunner,A.M. 
Cloning and expression of simian transforming growth factor-beta 1 
Patent: EP 0373994-Al 2 20-JUN-1990; 
Locat ion/ Qual i f iers 



301 



1. .1560 

/ organ i sm= " unknown " 
a 542 c 444 g 



267 t 



6 others 



Query Match 85.5%; Score 310.2; DB 6; 

Best Local Similarity 94.7%; Pred. No. 1.3e-54; 
Matches 321; Conservative 0; Mismatches 18; 



Length 1560; 
Indels 0; 



Gaps 



0; 



Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll IIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIII 
Db 1095 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 1154 



Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



8 5 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

II lllllllllll llllllll llllllllllllll II II IIIIIIIIMIIIII 
1155 TATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 1214 



145 



204 



GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 

Mill IIIMIIIIIIIIIIIIIMIIIIII MIMIII Mill llllllllllll 

1215 GCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 1274 



205 



264 



GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 

llllllll IIIIIIIIIMIII IIMIIIIIII llllllllllllllllllllllll 
1275 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 1334 



265 



324 



CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 

Mill III IIIIMIIII lllllllllll MM Mill IIIIMIIIIMII III Mill 

1335 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 13 94 



325 



363 



CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 

MIMIIIIIIIIIIIIIII Mill II lllllllll 

13 95 CTGTCCAACATGATCGTGCGCTCCTGAAAATGCAGCTGA 1433 



RESULT 12 

106216 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 



106216 
Sequence 
106216 
106216. 1 

Unknown . 



1560 bp DNA 
2 from Patent EP 0293785. 

GI : 590649 



linear PAT 02-DEC-1994 



ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 

BASE COUNT 
ORIGIN 



Unknown . 

Unclassified . 

1 (bases 1 to 1560) 

Purchio, A, F. , Gentry, L. and Twardzik,D. 

Cloning and expression of simian transforming growth factor-SSI 
Patent: EP 0293785-A2 2 07-DEC-1988; 
Location/Qualifiers 



301 



1. .1560 

/organism= "unknown" 
a 547 c 442 g 



267 t 



3 others 



Query Match 8 5.2%; Score 309.2; DB 6; 

Best Local Similarity 94.4%; Pred. No. 2.2e-54; 
Matches 320; Conservative 0; Mismatches 19; 



Length 1560; 
Indels 0; 



Gaps 



0; 



Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

IIIIIIM IMIIIIIIMIIIIIMIMIIIIIIIMIIIIIM IIIIMIIIIII 

Db 10 95 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTNCGTGCGGCAGCTG 1154 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

II lllllllllll IIIIIIM MIIIIIIIMIII II II IIIIIIIIIIIMII 

Db 1155 TATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 1214 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill IIIIIIIMIIMIIIIIIIIIIIII IMIIIII IMM MIIIIIIIIM 

1215 GCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 1274 



205 



264 



GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 

llllllll IIIIIIIIIMIM lllllllllll llllllllllllllllllllllll 
1275 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 1334 



265 



324 



CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 

II lllllllllll MM Mllll III MM III IIIMIIMMIMIMII III II III 

1335 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 13 94 



325 



363 



CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 

IMIIIIIIIIIIIIIIIII Mill II lllllllll 
1395 CTGTCCAACATGATCGTGCGCTCCTGAAAATGCAGCTGA 1433 



RESULT 13 

BOVTGFB 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



linear MAM 27-APR-1993 
(TGF beta-1) mRNA, 3' end. 



TITLE 



BOVTGFB 1117 bp mRNA 

Bovine transforming growth f actor-beta-1 

M36271 

M36271.1 GI:163747 
transforming growth factor-beta 1. 
Bos taurus (cow) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla; Ruminant ia; Pecora; Bovoidea; 

Bovidae; Bovinae; Bos. 
1 (bases 1 to 1117) 

Van Obberghen-Schilling,E. , Kondaiah,P., Ludwig,R.L., Sporn^M.B. 
and Baker, C.C. 

Complementary deoxyribonucleic acid cloning of bovine transforming 



JOURNAL 
MEDLINE 

PUBMED 
COMMENT 

FEATURES 

source 



CDS 



mat_peptide 



BASE COUNT 
ORIGIN 



growth factor-beta 1 

Mol, Endocrinol. 1 (10), 693-698 (1987) 
91042552 

3153459 

Original source text: Bovine f ibropapilloma , cDNA to mRNA, (library 
of Okayama and Berg) . 

Location/Qualifiers 

1. .1117 

/organism="Bos taurus" 
/mol_type="mRNA" 
/db_xref="taxon: 9913" 
<1. .950 

/note=" trans forming growth f actor-beta-1 precursor" 
/ codon_s tart =3 
/protein_id="AAA3 0778 .1" 
/db_xref ="GI : 163748" 

/translation^ " AI LALYNSTRDRVAGESAETE PEPEAD YYAKEVTR VLMVEYGNK 
lYDKMKSSSHSIYMFFNTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNN 
SWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREEIEGFRLSAHCSCDSKDNTLQVD 
I NGFSSGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTEKNC 
CVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 
AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
612. .947 

/product =" trans forming growth factor-beta -1" 
244 a 379 c 301 g 193 t 



Query Match 8 5.1%; 

Best Local Similarity 92.8%; 
Matches 324; Conservative 



Score 3 09; DB 4 ; Length 1117; 
Pred. No. 2.5e-54; 
0; Mismatches 25; Indels 0; 



Gaps 



0; 



Qy 

Db 



15 CGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGT 74 

I II I llllllll lllllllllllllllllllllll II lllllllllll II 
602 CCACCGCCGAGCCCTGGACACCAACTACTGCTTCAGCTCCACAGAAAAGAACTGCTGTGT 661 



Qy 

Db 



75 GCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAA 134 

II IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMilllllllllMIIIIIIIII 

662 TCGTCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAA 721 



Qy 

Db 



135 GGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCA 194 

III Mill lllilllllllllllllllllll II llllllllllllll II II II 
722 GGGGTACCACGCCAATTTCTGCCTGGGGCCCTGCCCTTACATCTGGAGCCTGGATACACA 781 



Qy 

Db 

Qy 
Db 

Qy 

Db 



195 GTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTG 254 

llllllllllllllllll llllllllllllllllllllllllll llllllllllllll 
782 GTACAGCAAGGTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCTTCGGCGGCGCCGTG 841 



255 



842 



315 



902 



CTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAA 314 

lllllllll IIIIIIMIIIIII IIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIII 
CTGCGTGCCTCAGGCGCTGGAGCCCCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAA 901 



GGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 

lllllllll! Illllllllllllllllll llllllllllllllllll 
GGTGGAGCAGTTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 



363 



950 



RESULT 14 



OCAF000133 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



OCAF000133 339 bp mRNA linear MAM 06-MAY-1997 

Oryctolagus cuniculus transforming growth factor beta-1 mRNA, 
partial cds . 
AF000133 

AF000133 . 1 GI :2072531 

Oryctolagus cuniculus (rabbit) 
Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

1 (bases 1 to 339) 

Taylor, T.K., James, E.R., McGonigle,S. andYoho,E.R. 
Rabbit transforming growth factor beta-1 active region 

Unpublished 

2 (bases 1 to 339) 

Taylor, T.K., James, E.R., McGonigle,S. and Yoho,E.R. 

Direct Submission 

Submitted (16-APR-1997) Ophthalmology, Med. Univ. S.C., 171 Ashley 
Avenue, Charleston, SC 29464, USA 

Location/Qualifiers 

1. .339 

/organism^ "Oryctolagus cuniculus" 
/ mo 1 t yp e = " mRNA " 
/db_xref="taxon: 9986" 
<1. .339 

/note=" encodes active region" 
/ codon__s t a r t = 1 

/product =" trans forming growth factor beta-1" 
/protein_id="AAB53806. 1" 
/db_xref="GI : 2072532" 

/translation="ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFC 
LGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLS 
NMIVRSCKCS" 
71 a 114 c 97 g 57 t 



Query Match 85.0%; 
Best Local Similarity 94.4%; 
Matches 320; Conservative 



Score 3 08 .6; DB 4 ; 
Pred, No. 3.3e-54; 
0; Mismatches 19; 



Length 339; 



Indels 



0 ; Gaps 



0; 



Qy 

Db 



25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll lllllllllllllllllllllll llllllllllllll lllllllllll 
1 GCCCTGGACACCAACTACTGCTTCAGCTCCACAGAGAAGAACTGCTGTGTGCGGCAGCTG 60 



Qy 

Db 

Qy 
Db 

Qy 
Db 



61 



145 



TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 

llllllllllllll IIIIIIMIIIIIIIIIIIIIII II II IMIIIIIIIIIII 
TACATTGACTTCCGCAAGGACCTGGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAC 



GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 

lllll lllllllllll Mill IIIIIMIIIIIIIIII Mill llllllllllll 

121 GCCAACTTCTGCCTGGGACCCTGCCCCTACATCTGGAGCCTGGACACCCAGTACAGCAAG 



205 



181 



GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 

IIIIMII IIIIIIIIIIIIIIIIMIMIIIIIIIIIII MIIIIIMII lllll 

GTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCGTCGGCAGCGCCGTGCTGTGTGCCA 



144 



120 



204 



180 



264 



240 



Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIII 

Db 241 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 00 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIIIIIIIIMIIIIIIIII IIIMIIMIIIIIIIII 

Db 301 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 339 



RESULT 15 
A18277 

LOCUS 

DEFINITION 
ACCESSION 

VERSION 

KEWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



A18277 339 bp mRNA linear PAT 17-MAy-1994 

H. sapiens TGF-beta 1 gene seq ID No:l. 

A18277 

A18277.1 GI:513237 

synthetic construct 
synthetic construct 
art i f icial sequences . 
1 (bases 1 to 339) 

Cerletti.N., McMaster , G . K. , Cox,D., Schmitz,A. and Meyhack,B, 
Process for the production of biologically active protein (e.g. 
TGF) 

Patent: EP 0433225-A 1 19-JUN-1991; 
CIBA-GEIGY AG 

Locat ion/Qual i f iers 

1. .339 

/organism=" synthetic construct" 
/mol_type=" mRNA" 
/db_xref = " t axon : 3 2 63 0 " 
1. .339 

/note="Protein sequence is in conflict with the conceptual 

translation" 

/codon_start=l 

/transl_table=ll 

/product =" TGF -beta 1" 

/prot ein__id= " CAAO 1 3 8 5 . 1 " 

/db_xref="GI : 4529903" 

/ 1 rans 1 a t ion= " ALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFC 
LGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLS 
NMIVRSCKCS" 
66 a 114 c 100 g 59 t 



Query Match 85.0%; 
Best Local Similarity 94.4%; 
Matches 320; Conservative 



Score 308.6; DB 6; 
Pred. No. 3.3e-54; 
0; Mismatches 19; 



Length 339; 



Indels 



0 ; Gaps 



0; 



uy 

Db 



2b GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll IIIIIIIIIIIIMIIIMIIIIIIIIIIIMIIIIIIIII 

1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 60 



Qy 

Db 



85 



61 



TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 

llllllllllllll llllllll llllllllllllll II II lllllllllllllll 

TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 



144 



120 



Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 



IMII llllllll llllllll llllllll llllllll Mill llllllllllll 

Db 121 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 18 0 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

llllllll IIMIIMIIIMI IMMIIIIII llllllllllllllllllllllll 

Db 181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 24 0 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

llllllllllll!! Illllllllllllllllllllllllllllllllllllllllllll 
Db 241 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 00 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

llllllllllllllllllll llllllllllllllllll 
Db 301 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 339 



Search completed: October 27, 2003, 18:34:44 
Job time : 1417.77 sees 

GenCore version 5,1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 27, 2003, 10:47:27 ; Search time 109.057 Seconds 

(without alignments) 
8985.201 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-017-372E-10 

363 

1 gactacaaggatgacgacga gttcctgcaagtgcagctga 363 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



2552756 seqs, 1349719017 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5105512 



Database : N_Geneseq_19Jun03 : * 

1 : /SIDSl/gcgdata/geneseq/geneseqn- 

2 : /SIDSl/gcgdata/geneseq/geneseqn- 

3 : /SIDSl/gcgdata/geneseq/geneseqn- 

4 : /SIDSl/gcgdata/geneseq/geneseqn- 

5 : /SIDSl/gcgdata/geneseq/geneseqn- 

6 : /SIDSl/gcgdata/geneseq/geneseqn- 

7 : /SIDSl/gcgdata/geneseq/geneseqn- 

8 : /SIDSl/gcgdata/geneseq/geneseqn- 

9 : /SIDSl/gcgdata/geneseq/geneseqn- 



embl/NA198 0.DAT:* 
embl/NA1981 .DAT: * 
embl/NA1982 .DAT: * 
embl/NA1983.DAT:* 
embl/NA1984 .DAT: * 
embl/NA1985 .DAT:* 
embl/NA1986 .DAT: * 
embl/NA1987 .DAT: * 
embl/NA1988 .DAT: * 



10 

11 

12 

13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



/ S I DS 1 / gcgda ta/geneseq/genes eqn - 
/SIDSl/gcgdata/geneseq/geneseqn- 
/SIDSl/gcgdata/geneseq/geneseqn- 
/SIDSl/gcgdata/geneseq/geneseqn- 
/SI DSl /gcgda ta/geneseq/geneseqn - 
/SIDSl/gcgdata/geneseq/geneseqn- 
/S I DSl /gcgda ta/geneseq/geneseqn - 
/SIDSl/gcgdata/geneseq/geneseqn- 
/ S I DS 1 / gcgdat a / geneseq/ geneseqn- 
/SIDSl/gcgdata/geneseq/geneseqn- 
/SIDS 1 /gcgda ta/geneseq/geneseqn - 
/SIDSl/gcgdata/geneseq/geneseqn- 
/SIDSl/gcgdata/geneseq/geneseqn- 
/SIDSl/gcgdata/geneseq/geneseqn- 
/SIDSl/gcgdata/geneseq/geneseqn- 
/SI DSl /gcgda ta/geneseq/geneseqn - 



embl/NA198 9 .DAT: * 
embl/NA1990.DAT:* 
embl/NA1991.DAT:* 
enibl/NA1992 .DAT:* 
embl/NA1993.DAT:* 
embl/NA1994.DAT:* 
embl/NA1995.DAT:* 
embl/NA1996 .DAT: * 
embl/NA1997 .DAT:* 
embl/NA1998,DAT:* 
embl/NA1999.DAT:* 
embl/NA2000 .DAT: * 
embl /NA2 0 0 1 A , DAT : * 
embl /NA2 0 0 IB . DAT : * 
embl/NA2 002 .DAT:* 
embl/NA2 003 .DAT:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


339 


93 


4 


1326 


24 


AAD22696 


Porcine transformi 


2 


312 


8 


86 


2 


489 


24 


ABL99528 


Target canine gene 


3 


311 


8 


85 


9 


1303 


11 


AAQ09317 


Monkey trans formin 


4 


311 


8 


85 


9 


1559 


13 


AAQ20289 


Sequence encoding 


5 


311 


8 


85 


9 


1561 


11 


AAQ03268 


Simian transformin 


6 


311 


8 


85 


9 


1571 


11 


AAQ03269 


Human transforming 


7 


310 


2 


85 


5 


339 


18 


AAT42771 


TGF-betal active f 


8 


310 


2 


85 


5 


339 


20 


AAV99375 


cDNA encoding a tr 


9 


310 


2 


85 


5 


1560 


9 


AAN81084 


Coding sequence of 


10 


310 


2 


85 


5 


1560 


11 


AAQ03508 


Simian Transformin 


11 


308 


6 


85 


0 


339 


17 


AAT17235 


Human TGF-beta 1 c 


12 


308 


6 


85 


0 


339 


20 


AAX15245 


cDNA encoding the 


13 


308 


6 


85 


0 


650 


24 


ABK84023 


Human cDNA differe 


14 


308 


6 


85 


0 


650 


24 


ABL68818 


Kidney cancer rela 


15 


308 


6 


85 


0 


1176 


24 


ABZ35738 


Human TGF beta 1 p 


16 


308 


6 


85 


0 


1176 


24 


ABX09981 


Human TGFbetal DNA 


17 


308 


6 


85 


0 


1176 


24 


ABV78162 


Human TGF beta 1 D 


18 


308 


6 


85 


0 


1176 


24 


ABL91703 


Human polynucleoti 


19 


308 


6 


85 


0 


1176 


25 


ABV75391 


TGFBl Arg25Pro pol 


20 


308 


6 


85 


0 


1176 


25 


ABV75392 


TGFBl Arg25Pro pol 


21 


308 


6 


85 


0 


1569 


9 


AAN81085 


Coding sequence of 


22 


308 


.6 


85 


0 


1821 


12 


AAQ13392 


Human pro -TGF -beta 


23 


308 


6 


85 


0 


2537 


/ 




Sequence encoding 


24 


308 


6 


85 


0 


2537 


11 


AAQ03301 


cDNA encoding huma 


25 


308 


.6 


85 


0 


2537 


11 


AAQ02814 


Sequence of pre-TG 


26 


308 


6 


85 


0 


2537 


15 


AAQ56923 


Human pre-TGF-beta 


27 


308 


6 


85 


0 


2537 


17 


AAT15720 


Pre-transforming g 


28 


308 


6 


85 


0 


2537 


19 


AAV52 933 


Human pre -trans for 


29 


308 


6 


85 


0 


2742 


22 


AAI58342 


Human polynucleoti 


30 


308 


6 


85 


0 


2745 


16 


AAT05876 


cDNA encoding tran 


31 


308 


6 


85 


0 


2745 


22 


AAH28216 


Nucleotide sequenc 



32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 



304 .4 
284 . 8 
284 .6 
284 .6 
269.6 
259.2 



308 .6 
308 .6 
308 .6 



307 
307 
307 

307 
307 



85.0 
85.0 
85.0 
84.6 
84.6 
84.6 
84 . 6 
84.6 
83.9 
78 . 5 
78 .4 
78 .4 
74.3 
71.4 



3541 
3541 
4105 
339 
339 
1266 
1569 
2527 
345 
336 
1561 
1561 
875 
336 



17 
21 
15 
12 
17 
24 
11 
25 
14 
14 
11 
13 
23 
14 



AAT16516 
AAA12498 
AAQ55624 
AAQ11993 
AAT15462 
ABK12871 
AAQ03509 
ABQ76674 
AAQ41599 
AAQ41606 
AAQ04908 
AAQ29177 
AAS70979 
AAQ41604 



Collagen Al/TGF-be 
cDNA encoding a ch 
TGFbetal 5'-UTR-CD 
Transforming Growt 
Human transforming 
Chimeric TGF-B-SA 
Human Transforming 
Androgen receptor 
Mature human Trans 
Transforming Growt 
Sequence encoding 
TGF-beta 1/beta 2 
DNA encoding novel 
Transforming Growt 



ALIGNMENTS 



RESULT 1 
AAD22696 

ID AAD22696 standard; cDNA; 1326 BP. 
XX 

AC AAD22696; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) cDNA. 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 
KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 
KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 
KW diabetes mellitus; sarcoidosis; psoriasis; dermatological ; ss. 
XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualifiers 
FT CDS 16.. 1188 

FT /*tag= a 

FT /product= "Porcine TGF-betal mutant protein" 

XX 

PN WO200181404-A2 . 
XX 

PD Ol-NOV-2001. 
XX 

PF 20-APR-2001; 2 001WO-US12 980 . 
XX 

PR 20-APR-2000; 2000US-199014P. 
XX 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ; 
XX 

DR WPI; 2002-026155/03. 
DR P-PSDB; AAE13596. 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 



PT disease in humans, comprises vector containing transforming growth 

PT factor-beta under the control of inducible promoter 

XX 

PS Claim 1; Fig 1; 78pp; English. 
XX 

CC The invention relates to a composition containing a vector comprising a 

CC gene encoding a regulatory transcription factor under the control of a 

CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vector is useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transfecting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease (IBD) , under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered using a delivery system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-10 by the host. The 

CC composition is useful for treating various diseases with an autoimmune 

CC component such as multiple sclerosis, rheumatoid arthritis, systemic 

CC lupus erythematosus, insulin-dependent diabetes mellitus, sarcoidosis 

CC and psoriasis, and also for assaying the expression of a gene in a cell. 

CC The vector is further useful for screening of the effect of test 

CC compounds on cytokine (e.g. TGF-beta) expression of transfected cells. 

CC The present sequence is a cDNA encoding porcine TGF-betal mutant. 

XX 

SQ Sequence 1326 BP; 263 A; 438 C; 392 G; 233 T; 0 Other; 

Query Match 93.4%; Score 339; DB 24; Length 1326; 

Best Local Similarity 100.0%; Fred. No. 8.6e-71; 

Matches 33 9; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 850 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 909 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 910 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 969 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

IIIIIIIIIIIIIIIIIIIIIMIIMIIIIMMIIIIIIIIIIIIIIIIIIIIIIIII 

Db 970 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 1029 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

IIIIIMMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

Db 103 0 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 108 9 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 1090 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 114 9 

Qy 32 5 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIMIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1150 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1188 



RESULT 2 
ABL99528 



ID ABL99528 standard; cDNA; 48 9 BP. 
XX 

AC ABL9 9 52 8; 
XX 

DT 02-JUL-2002 (first entry) 
XX 

DE Target canine gene TGFBl . 
XX 

KW Canine gene array; toxicological response; ss. 
XX 

OS Cams sp . 
XX 

PN WO200208453-A2. 
XX 

PD 31-JAN-2002. 
XX 

PF 23-JUL-2001; 2001WO-US23311 . 
XX 

PR 21-JUL-2000; 2000US-22 0057P . 
XX 

PA (PHAS-) PHASE- 1 MOLECULAR TOXICOLOGY- 
XX 

PI Farr SB, Pickett GO, Neft RE, Dunn RT; 
XX 

DR WPI; 2002-217063/27. 
XX 

PT Identifying toxicologically relevant canine gene to determine 

PT toxicological responses of agents, by obtaining and comparing gene 

PT expression profiles of untreated canine cells and canine cells treated 

PT with an agent 

XX 

PS Disclosure; Page 67; 140pp/ English. 
XX 

CC This invention relates to identifying a toxicologically relevant canine 

CC gene and the generation of an array of toxicologically relevant 

CC canine genes. The gene array is useful for obtaining a gene expression 

CC profile, by exposing a population of cells to an agent, obtaining cDNA 

CC from the population of cells, labeling the cDNA, and contacting the cDNA 

CC with the gene array. The relevant gene is useful for making and using 

CC arrays to determine toxicological responses to various agents, and also 

CC useful for identifying novel gene sequences and novel canine genes. 

CC The method for analysing toxicological responses using the canine 

CC gene array is rapid and efficient. The present sequence is related 

CC to the canine gene array, 

XX 

SQ Sequence 489 BP; 84 A; 183 C; 145 G; 76 T; 1 Other; 

Query Match 86.2%; Score 312.8; DB 24; Length 489; 

Best Local Similarity 95.0%; Pred. No. l.le-64; 

Matches 323; Conservative 0; Mismatches 17; Indels 0; Gaps 0; 

Qy 24 GGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCT 83 

lllllllll IIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII llllllll 
Db 71 GGCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTCCGGCAGCT 13 0 

Qy 84 CTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCA 143 

IMIIIIIIIIIIII lllll IIIIMIIIMIIIIII Mill llllllll Mill 



Db 



131 CTACATTGACTTCCGCAAGGATCTGGGCTGGAAGTGGATCCATGAGCCCAAGGGTTACCA 190 



Qy 144 TGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAA 2 03 

II II IIIIIIIIIMMIIII IIIMIM llllllll Mill lllllllllll 

Db 191 CGCTAACTTCTGCCTGGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAA 250 

Qy 204 GGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCC 263 

llllllll! IIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 251 GGTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCC 310 

Qy 264 GCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA 323 

IIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIMIIMIIIIIIIMIIIIMIIII 

Db 311 GCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA 370 

Qy 324 GCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIMII IIIIIIIMMIII IMIIIIIIIMMIIII 

Db 371 GCTGTCGAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 410 



RESULT 3 
AAQ09317 

ID AAQ09317 standard; cDNA; 1303 BP. 
XX 

AC AAQ09317; 
XX 

DT 25-MAR-2003 (updated) 

DT 12-AUG-1990 (first entry) 

XX 

DE Monkey transforming growth factor-beta cDNA. 
XX 

KW Transforming growth factor-beta; simian; psoriasis; 

KW TGF-beta. 

XX 

OS Monkey . 
XX 

FH Key Location/Qualifiers 
FT sig_peptide 22.. 63 

FT /*tag= a 

FT mat_peptide 836.. 1170 

FT /*tag= b 

FT /product =monkey transforming growth factor-beta 

XX 

PN EP353772-A. 
XX 

PD 07-FEB-1990. 
XX 

PF 04-AUG-1989; 89EP-0114458 . 
XX 

PR 05-AuG-iy8b; 88US-0229133 . 
XX 

PA (ONCO ) ONCOGEN LP. 
XX 

PI Twardzik DR, Purchio AF, Ranchalis JE, Stevens V; 
XX 

DR WPI; 1990-038499/06. 
DR P-PSDB; AAR03743 , 
XX 



PT Inhibition of proliferation of epidermal cells - 

PT used to treat psoriasis by contacting cells with compositions 

PT containing transforming growth factor-beta. 

XX 

PS Disclosure; fig 1; 20pp; English. 
XX 

CC TGF-beta may be used in the treatment of hyperplasia 

CC associated with acanthosis-categorised skin diseases, and 

CC in alleviating psoriatic symptoms associated with cytokine- 

CC induced phenomena. See also AAQ03268 and AAR03750. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

XX 

SQ Sequence 1303 BP; 263 A; 442 C; 378 G; 217 T; 3 Others- 



Query Match 8 5.9%; Score 311.8; DB 11; Length 1303; 

Best Local Similarity 95.0%; Pred. No. 2.3e-64; 

Matches 322; Conservative 0; Mismatches 17; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

Illlllll IIIIIIIIMIIMIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIII 

Db 838 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 8 97 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

II lllllllllll Illlllll lllllllllllill II II lllllllllllllll 
Db 898 TATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 957 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

IIIM llllllllllllllllllllllllll Illlllll Mill IIIIMMIIII 

Db 958 GCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 1017 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

Illlllll lllllllllllill lllllllllll llllllllllllllllllllllll 
Db 1018 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 1077 



Qy 2 65 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIIIIIIIIIIIMMIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 

Db 1078 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1137 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

llllllllllllllllllll IMIIIII lllllllll 
Db 1138 CTGTCCAACATGATCGTGCGCTCCTGCAAATGCAGCTGA 1176 



RESULT 4 
AAQ2028 9 

ID AAQ20289 standard; cDNA; 1559 BP. 
XX 

AC AAQ2 028 9; 
XX 

DT 25-MAR-2003 (updated) 

DT 16-APR-1992 (first entry) 

XX 

DE Sequence encoding simian transforming growth factor (TGF) beta-1. 
XX 

KW Hypertension therapy; hypotensive agent; blood pressure modulator; 

KW ss . 

XX 



OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



tnat_peptide 



Monkey . 



Key 
CDS 



sig_pepticle 



CDS 



Location/Qual if iers 



262 . .282 
/*tag= a 
283, .324 
/*tag= b 
325. .1098 
/*tag= c 
1099. .1436 
/*tag= d 



XX 

PN W09119513-A- 
XX 

PD 26-DEC-1991. 
XX 

PF 20-JUN-1991; 91WO-US04449 . 
XX 

PR 20-JUN-1990; 9 OUS-054 122 1 . 
XX 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 
XX 

PI Oleson FB, Comereski CR; 
XX 

DR WPI; 1992-024199/03. 

DR P-PSDB; AAR20124. 
XX 

PT Use of transforming growth factor (TGF) -beta and their 

PT antagonists - for modulating blood pressure, for treating 

PT hypertension and hypotension 
XX 

PS Disclosure; Fig 1; 42pp; English. 
XX 

CC A new method for treating hypertension comprises administering a 

CC transforming growth factor (TGF) -beta to an individual at a dose 

CC effective for lowering blood pressure; the TGF-beta may be e.g. 

CC mature TGF-beta, TGF-beta2, a mature TGF-betal/beta2 hybrid, TGF- 

CC betal precursor, a latent TGF-beta2 precursor, hybrid TGF-betal/TGF- 

CC beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 

CC complex. 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 1559 BP; 300 A; 546 C; 446 G; 267 T; 0 other; 

Query Match 85.9%; Score 311.8; DB 13; Length 1559; 
Best Local Similarity 95.0%; Pred. No. 2.3e-64; 

Matches 322; Conservative 0; Mismatches 17; Indels 0; Gaps 0; 

Qy 25 GCCCl'GGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

MINIM IIIIIIMIIMIIIIIMIIIMIIIIIIIMIIIIIIMMIIMMI 

Db 1095 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 1154 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 



Db 




Qy 14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 



Db 



IIIM IIIIMIIIIIMIIIIIIIIIMII IMMMI Illll llllllllllll 

1215 GCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 1274 



Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

Illlllll IIIIMIIIIIIII lllllllllll llllllllllllllllllllllll 

Db 1275 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 1334 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGOVG 324 

IIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

Db 133 5 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1394 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIIIIIIMIIIIIIIIMI Illlllll lllllllll 

Db 1395 CTGTCCAACATGATCGTGCGCTCCTGCAAATGCAGCTGA 1433 



RESULT 5 




AAQ03268 




ID 


AAQ03268 standard; DNA; 1561 BP. 


XX 






AC 


AAQ03268; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


12-AUG-1990 


(first entry) 


XX 






DE 


Simian transforming growth factor-beta cDNA. 


XX 






KW 


Transforming growth factor-beta; psoriasis; TGF-beta; ss. 


XX 






OS 


Monkey . 




YV 

AA 






FH 


Key 


Location/Qualifiers 


FT 


sig_peptide 


283 . .324 


FT 




/*tag= a 


FT 


mat_peptide 


1096 . . 1431 


FT 




/*tag= b 


FT 




/product =human transforming growth factor 


XX 






PN 


EP353772-A. 




XX 






PD 


07-FEB-1990. 




XX 






PF 


04 -AUG- 198 9; 


89EP-0114458 . 


XX 






PR 


05-AUG-1988; 


88US-0229133 . 


XX 






PA 


(ONCO ) ONCOGEN LP. 


XX 






F I 


TwajLuzik DR, 


Purchio AF, Ranchaiis JE, Stevens V; 


XX 






DR 


WPI; 1990-038499/06. 


DR 


P-PSDB; AAR03743. 


XX 






PT 


Inhibition of proliferation of epidermal cells - 


PT 


used to treat 


psoriasis by contacting cells with composit: 


PT 


containing transforming growth factor-beta. 


XX 







PS Disclosure; fig 1; 2 0pp; English. 
XX 

CC TGF-beta may be used in the treatment of hyperplasia 

CC associated with acanthosis-categorised skin diseases, and 

CC in alleviating psoriatic symptoms associated with cytokine- 

CC induced phenomena. See also AAQ03269 and AAR03750. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

XX 

SQ Sequence 1561 BP; 301 A; 547 C; 446 G; 267 T; 0 Other; 

Query Match 85.9%; Score 311.8; DB 11; Length 1561; 

Best Local Similarity 95.0%; Pred . No. 2.3e-64; 

Matches 322; Conservative 0; Mismatches 17; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 
Db 1096 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 1155 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 14 4 

I! IIIIMIIIII llllllll IIIIIMIIIIIII M II lllllllllllllll 

Db 1156 TATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 1215 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill IIIIIIIIIIIIIIIIIIIIMIIII llllllll Mill IIIMIIIIMI 

Db 1216 GCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 1275 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

IIIIMII Mllllllllllll IIIIIMMII IIIIIMIMIIIIIIIIIMIII 

Db 1276 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 1335 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

MM Mill llllllllllllll III Ml MM lllllllllll III Mill llllllll 

Db 1336 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 13 95 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 3 63 

MMIIIIIIIIIIIIIIII IIMIIII lllllllll 

Db 13 96 CTGTCCAACATGATCGTGCGCTCCTGCAAATGCAGCTGA 1434 



RESULT 6 
AAQ03269 

ID AAQ03269 standard; DNA; 1571 BP. 
XX 

AC AAQ03269; 
XX 

DT 25-MAR-2003 (updated) 

DT 12-AUG-1990 {first entry) 

XX 

DE Human transforming growth factor-beta cDNA, 
XX 

KW Transforming growth factor-beta; psoriasis; 

KW TGF-beta. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT sig_peptide 22.. 63 



r 1 




/*tag= a 


r 1 


mat peptide 


o o ^ 1 1 n 
o J b . . i i / U 


r i 




/*tag= b 


b i 




/product =human trans forming growth factor -beta 


V V 

AA 








EP353772-A. 




YY 






Dn 


07-FEB-1990 . 




V V 

AA 






DO 

rr 


04-AUG-1989; 


89EP-0114458 . 


YY 
AA 






DD 

rK 


05-AUG-1988; 


88US-0229133. 


Y Y 
AA 






D A 

FA 


(ONCO ) ONCOGEN LP. 


Y Y 

AA 






D T 


Twardzik DR, 


Purchio AF, Ranchalis JE, Stevens V; 


YY 

A. A. 






UK 


WPI; 1990-038 


499/06. 


UK 


P-PSDB; AAR03750. 


YY 
AA 








Inhibition of 


proliferation of epidermal cells - 


DT 

XT 1 


used to treat 


psoriasis by contacting cells with compositions 


r 1 


containing transforming growth factor-beta. 


vv 
AA 






DC 


Disclosure; fig 1; 20pp; English. 


XX 






cc 


TGF-beta may 


be used in the treatment of hyperplasia 


cc 


associated with acanthosis-categorised skin diseases, and 


cc 


in alleviating psoriatic symptoms associated with cytokine- 


cc 


induced phenomena. See also AAQ03268 and AAR03743. 


cc 


(Updated on 25-MAR-2003 to correct PA field.) 


XX 






SQ 


Sequence 1571 


BP; 299 A; 563 C; 443 G; 266 T; 0 Other; 



Query Match 85.9%; Score 311.8; DB 11; Length 1571; 

Best Local Similarity 95.0%; Pred. No. 2.3e-64; 

Matches 322; Conservative 0; Mismatches 17; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

Illlllll IIIIIIIIIIIIIIIIIIIIIMIIIIIIMMIMIIIIIIIIIIIIII 

Db 1106 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 1165 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

II lllllllllll IIMIIII IIIIIMIIIIIII II II IIIIIIIIIIIMM 
Db 1166 TATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 1225 

Qy 14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill IIIIIIIIMIIIIIIIIIIIIIIII Illlllll Mill IIIIIIIMIII 

Db 1226 GCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 128 5 

Qy 2 05 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

Illlllll IIIIIIIIMIMI IIIMIIIIII IIIMIIIIIIIIMIIMIIIM 

Db 12 86 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 134 5 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

MMMIMMMMMMMMMMMIMMMMMMMMIMMMMM III 

Db 1346 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 14 05 



Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIIMIMIIllllllllll IIIIIIM lllllllll 

Db 1406 CTGTCCAACATGATCGTGCGCTCCTGCAAATGCAGCTGA 1444 

RESULT 7 
AAT42771 

ID AAT42771 Standard; CDNA; 339 BP. 
XX 

AC AAT42771; 

XX 

DT 26-AUG-1997 (first entry) 
XX 

DE TGF-betal active fragment of a TGF-beta fusion protein encoding cDNA, 

XX 

KW Transforming growth factor-beta fusion protein; wound healing; 

KW artificial skin; surgery recovery time; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT CDS 1 . .33 9 

FT /*tag= a 

FT /function^ TGF active fragment 

XX 

PN WO9639430-A1. 

XX 

PD 12-DEC-1996. 
XX 

PF 05-JUN-1996; 96WO-US08973 . 
XX 

PR 06-JUN-1995; 95US-0470837 . 
XX 

PA (CHEU/) CHEUNG D T. 
PA (HALL/) HALL F L. 
PA (NIMN/) NIMNI M E. 
PA (TUAN/) TUAN T. 
PA (WULL/) WU L. 
XX 

PI Cheung DT, Hall FL, Nimni ME, Tuan T, Wu L; 
XX 

DR WPI; 1997-043065/04. 
DR P-PSDB; AAW08173. 
XX 

PT Prepn. of transforming growth factor-beta fusion protein - useful to 

PT reduce surgery recovery time and to prepare artificial skin 

XX 

PS Disclosure; Page 44-45; 59pp; English. 
XX 

CC A novel transforming growth factor-beta (TGF-beta) fusion protein 
CC comprises a purification tag and a TGF active fragment. The present 
CC sequence encodes a specifically claimed TGF active fragment, TGF-betal. 
CC Additionally, the fusion protein may comprise proteinase-sensit ive 
CC linker sites and binding domain so the protein sequence may contain 
CC some or all of the following elements: purification tag: proteinase 
CC site:ECM binding site:proteinase site : TGF-beta . TGF-beta promotes 



CC wound healing, and the fusion protein can be used to reduce surgery 

CC recovery time and in the preparation of artificial skin. The inclusion 

CC of a purification tag facilitates purification of the fusion protein. 

CC The proteinase site is included to permit cleavage and release of the 

CC purification tag after purification if desired. The extracellular 

CC matrix binding site facilitates delivery of the fusion protein to the 

CC desired site of action. Delivery of the TGF-beta to the site to be 

CC treated reduces the amount of TGF-beta required to be administered to 

CC be effective and reduces the concentration of circulating TGF-beta 

CC which may result in undesirable effects. 
XX 

SQ Sequence 339 BP; 66 A; 113 C; 100 G; 60 T; 0 other; 

Query Match 85.5%; Score 310.2; DB 18; Length 339; 

Best Local Similarity 94.7%; Pred. No. 4.4e-64; 

Matches 321; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll IMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIll 
Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 60 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

llllllllllllll llllllll llllllllllllll Mill lllllllllllllll 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCATGAGCCCAAGGGCTACCAT 12 0 

Qy 14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill llllllll llllllll llllllll llllllll Mill llllllllllll 
Db 121 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 18 0 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

llllllll IIIIIIMIIIIII IIMIIIIIII IIIMIIIIIIIMIIIIIIIIII 

Db 181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 24 0 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

llllllllllllll IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 241 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 00 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

llllllllllllllllllll IIIIIIIIIMIIIIIII 

Db 301 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 33 9 



RESULT 8 

AAV9 9375 

ID AAV99375 standard; cDNA; 339 BP. 
XX 

AC AAV99375; 
XX 

DT 25-MAR-1999 (first entry) 
XX 

DE cDNA encoding a transforming growth factor beta active fragment. 
XX 

KW Proteinase site; bone morphogenetic fusion protein; bone binding site; 

KW bone morphogenetic protein; transforming growth factor beta; 

KW active fragment; wound healing; bone growth; purification tag; ds. 

XX 

OS Homo sapiens . 



XX 

PN W09855137-A1. 

XX 

PD lO-DEC-1998. 
XX 

PF 02-JUN-1998; 98WO-US11189 . 
XX 

PR 03-JUN-1997; 97US- 08684 52 . 
XX 

PA (HALL/) HALL F L. 

PA (HANB/) HAN B. 

PA (NIMN/) NIMNI M E. 

PA (SHOR/) SHORS E C. 

PA (WULL/) WU L. 

XX 

PI Hall FL, Han B, Nimni ME, Shors EC, Wu L; 
XX 

DR WPI; 1999-059875/05. 

DR P-PSDB; AAW84 207. 
XX 

PT New bone morphogenet ic fusion proteins - comprising a purification 

PT tag and a bone morphogenet ic active fragment, used for enhancing 

PT wound healing or bone growth 
XX 

PS Example 1; Page 41; 64pp; English. 
XX 

CC The present sequence encodes a transforming growth factor beta active 

CC fragment. The protein can be used in place of a bone morphogenecic 

CC active fragment to create the fusion proteins of the invention. When a 

CC bone morphogenetic active fragment is used, the fusion proteins are 

CC designated bone morphogenetic fusion proteins. The bone morphogenetic 

CC fusion protein may contain some or all of the following elements: a 

CC purification tag, a proteinase site, an ECM/bone binding site, a second 

CC proteinase site, and a bone morphogenetic protein active fragment. 

CC The bone morphogenetic fusion proteins can be used for enhancing wound 

CC healing or bone growth. 

XX 

SQ Sequence 339 BP; 66 A; 113 C; 100 G; 60 T; 0 other; 

Query Match 85.5%; Score 310.2; DB 20; Length 339; 
Best Local Similarity 94.7%; Pred. No. 4.4e-64; 

Matches 321; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 



Db 




Qy 



85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 



Db 




Qy 



14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 



Db 



121 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 180 



Qy 



2 05 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 
llllllll llllllllllllll lllllllllll llllllllllllllllllllllll 



Db 181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCX3TGCCG 24 0 



Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGC^^ 324 

MIIIIMMIIII IIMMIMIIIIIIIIMIMIIIIMMMMMIMMMII 

Db 241 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 00 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIIIMIIIIIIIIIIIIM IIIIIIMIIIIIMIII 
Db 301 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 339 



RESULT 9 




AAN81084 




ID 


AAN81084 standard; cDNA; 1560 BP. 


XX 






AC 


AAN81084 ; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


09-OCT-1990 


(first entry) 


XX 






DE 


Coding sequence of simian transforming growth factor 


XX 






KW 


Transforming growth factor-beta 1; tumour treatment; 


XX 






OS 


Cercopithecus aethiops. 


XX 






FH 


Key 


Location/Qualifiers 


FT 


CDS 


261 . . 1433 


FT 




/*tag= a 


FT 


sig_peptide 


282. .323 


FT 




/*tag= b 


FT 


mat peptide 


1095. .1433 


FT 




/*tag= c 


XX 






PN 


EP293785-A. 




XX 






PD 


07-DEC-1988 . 




XX 






PF 


27-MAY-1988; 


88EP-0108528. 


XX 






PR 


29-MAY-1987; 


87US-0055662. 


PR 


25-JAN-1988; 


88US-0147842. 


XX 






PA 


(ONCO ) ONCOGEN. 


PA 


(BRIM ) BRISTOL-MYERS CO. 


XX 






PI 


Purchio AG, 


Gentry L, Twardzik D; 


XX 






DR 


WPI; 1986-347488/49. 


DR 


P-PSDB; AAP80647. 


XX 






PT 


Prodn. of simian transforming growth factor beta-1 - 


PT 


transf ected 


eucaryotic cells, and new precursor prot^ 


PT 


treating tumours. 


XX 






PS 


Disclosure; 


Page ?; pp; English. 


XX 







by culturing 



CC The cDNA is prepd. from African green monkey cell line BSC-4 0 and is 

CC expressed in eukaryotic cells in plasmid pSV2 . There is 100% homology 

CC between mature simian and human TGF-beta 1. The plasmid also contains 

CC the SV4 0 promoter and a selection marker, esp. DHFR. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 

XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 others- 
Query Match 85.5%; Score 310.2; DB 9; Length 1560; 
Best Local Similarity 94.7%; Pred. No. 5.6e-64; 

Matches 321; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll IIIIIIMIIIIMIIIIIMIIIItllllllllllllllllllllllll 

Db 1095 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 1154 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

II IIIIIMIIII IIIIIMI IIIIIIIIIIMII II II IIIIIIIMIIIMI 

Db 1155 TATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 1214 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

Mill IIIIIIIIIIIIIIIIIIIIIIMII llllllll IIIM llllllllllll 

Db 1215 GCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 1274 

Qy 2 05 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

llllllll llllllllllllll lllllllllll llllllllllllllllllllllll 
Db 1275 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 1334 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IMIII llllllllllllllllllllllllll III Mllllllllllllllllllll III 

Db 1335 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1394 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

llllllllllllllllllll Mill II IIIIIIMI 
Db 1395 CTGTCC:AACATGATCGTGCGCTCCTGAAAATGCAGCTGA 1433 



RESULT 10 
AAQ03508 

ID AAQ03508 standard; DNA; 1560 BP. 
XX 

AC AAQ03508; 
XX 

DT 25-MAR-2003 (updated) 

DT 09-JAN-2003 (updated) 

DT 14-AUG-1990 (first entry) 
XX 

DE Simian Transforming growth tactor - Betal. 
XX 

KW HIV; AIDS; SIV; vaccine; AZT; CD4 ; Cytokines; growth 

KW factors; ds . 

XX 

OS Cebus apella. 
XX 

FH Key Location/Qualifiers 

FT CDS 267. . 1437 



FT /*tag= a 

FT mat_peptide 1103.. 1437 
FT /*tag= b 

XX 

PN EP356935-A. 
XX 

PD 07-MAR-1990. 

XX 

PF 25-AUG-1989; 89EP-0115719 . 
XX 

PR 25-AUG-1988; 88US-0236698 . 
XX 

PA (ONCO ) ONCOGEN LP. 
XX 

PI Brankovan V, Lioubin M, Purchio A; 
XX 

DR WPI; 1990-068723/10. 
DR P-PSDB; AAR05663. 
XX 

PT Compsns . contg . transforming growth factor beta - 

PT used for inhibitions of HIV infection and replication in vivo. 

XX 

PS Disclosure; Fig 1; 20pp; English. 
XX 

CC TGF-beta may be used in vivo to prevent formation of synctia and 

CC inhibit HIV infection. TGF may also be used with other HIV treatments 

CC (AZT, soluble CD4 etc.). 

CC (Updated on 09-JAN-2003 to add missing OS field.) 
CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 other; 

Query Match 85.5%; Score 310,2; DB 11; Length 1560; 

Best Local Similarity 94.7%; Pred. No. 5.6e-64; 

Matches 321; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

Illlllll IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1095 GCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 1154 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

II lllllllllll Illlllll Illlllllllllll M II lllllllllllllll 

Db 1155 TATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 1214 

Qy 14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill llllllllllllllllllllllllll Illlllll Mill IIIIMIIIIII 

Db 1215 GCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 1274 

Qy 205 GTCCl-GGCTCrGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

Mllllll Illlllllllllll lllllllllll llllllllllllllllllllllll 
Db 1275 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 1334 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

lllllllllllllll lllllll Ml Mllllll MM llllllllllll Ml MM II II 

Db 1335 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1394 



Qy 



325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 



IMIIIIIIIIIIIIIIIII Mill II lllllllll 

Db 1395 CTGTCCAACATGATCGTGCGCTCCTGAAAATGCAGCTGA 1433 



RESULT 11 
AAT17235 

ID AAT17235 Standard; cDNA to mRNA; 339 BP. 
XX 

AC AAT17235/ 
XX 

DT 17-JUL-1996 (first entry) 
XX 

DE Human TGF-beta 1 cDNA. 
XX 

KW Transforming growth factor type beta; TGF-beta 1; 

KW protein renaturation; protein folding; ds, 

XX 

OS Homo sapiens. 
XX 

PN WO9603433-A1. 
XX 

PD 08-FEB-1996. 

XX 

PF 12-JUL-1995; 95WO-EP02719 . 
XX 

PR 25-JUL-1994; 94EP-0810439 . 
XX 

PA (CIBA ) CIBA GEIGY AG. 
XX 

PI Cerletti N; 

XX 

DR WPI; 1996-117000/12. 

OR P-PSDB; AAR92773, 
XX 

PT Prodn. of dimeric biologically active transforming growth factor 

PT by refolding denatured monomer in detergent - free folding buffer 

PT contg. specific organic solvent to improve yield 
XX 

PS Example IB; Page 20-30; 54pp; English. 
XX 

CC The coding sequence (AAT17235) of human transforming growth factor 

CC TGF-beta 1 (AAR92773) was cloned into plasmid pGEM-5ZF (+) (Promega) 

CC and the construct used to transform E. coli Y1090. Subcloning in 

CC pPLMu yielded plasmid pPLMu . hTGF-beta 1. Non-soluble, monomeric 

CC TGF-beta 1 was recovered from E. coli LC 13 7/pPLMu. hTGF-beta 1 (DSM 

CC 5656) transformants . A biologically active, dimeric form of 

CC TGF-beta 1 was obtd. by refolding this monomer in detergent- free 

CC buffer contg. DMSO and/or DMF. Dimers of TGF-beta 3 (AAR92772) and 

CC TGF-beca 2 (AAR927 74) , and hybrid dimers (see also AAR92775-77) , were 

CC also produced. 
XX 

SQ Sequence 339 BP; 66 A; 114 C; 100 G; 59 T; 0 other; 

Query Match 85.0%; Score 308.6; DB 17; Length 339; 

Best Local Similarity 94.4%; Pred . No. l.le-63; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 



Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

IIMIMI IMMIII IMIIMIMIIIMMIIMIIIMIIIIMIIMIMII 

Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 60 

Qy 8 5 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IMIIMIIIIIII llllllll IMIIIIIIMIII II II IIIIIIIIIMIIII 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 120 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill llllllll llllllll llllllll llllllll Mill IIIIIIIIMII 

Db 121 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 180 

Qy 2 05 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

IMMIII IMIMIIMMII IIMIMIMI II M I II 1 1 II I II I M M II I M 

Db 181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 24 0 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIIIIIMIIIII IMIIMIIIIIMIIIIIIIIMMIIIIIMIIIIIIIIMII 

Db 241 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 00 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

llllllllllllllllllll llllllllllllllllll 
Db 301 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 339 

RESULT 12 
AAX15245 



ID AAX15245 Standard; cDNA; 339 BP. 
XX 

AC AAX15245; 
XX 

DT 20-MAR-2003 (updated) 

DT 28-APR-1999 (first entry) 

XX 

DE cDNA encoding the mature form of transforming growth factor-beta- 1 . 
XX 

KW Transforming growth f actor-beta-1 ; TGF-beta-1 ike protein; 

KW S-sulphonated TGF-beta-like protein; wound treatment; cancer; 

KW bone repair; tissue repair; bone marrow protective agent; 

KW cardioprotection; anti- inflammatory; immunosuppressive; 

KW ulcer; bed sore; ds. 

XX 

OS Homo sapiens. 
XX 

PN EP891985-A1. 
XX 

PD 20-JAN-1999. 
XX 

PF 27-NOV-1990; yBEP- 0113487 . 
XX 

PR 06-DEC-1989; 89GB-0027546 . 
PR 27-NOV-1990; 90EP- 08 10922 . 
XX 

PA (NOVS ) NOVARTIS AG. 
XX 

PI Cerletti Cox D, McMaster GK, Meyhack B, Schmitz A; 
XX 



DR WPI; 1999-083520/08. 

DR P-PSDB; AAW97091. 
XX 

PT Producing biologically active dimeric Transforming Growth 

PT Factor-beta - by refolding new monomeric Transforming Growth 

PT Factor-beta, useful for treatment of wounds and cancer 
XX 

PS Example 1; Page 28; 32pp; English. 
XX 

CC The present sequence encodes the mature form of transforming growth 

CC f actor-beta-1 . Dimeric, biologically active TGF-beta-like protein 

CC can be produced by subjecting the denatured monomeric form to refolding 

CC conditions. The new monomeric S-sulphonated TGF-beta-like protein is 

CC useful for the production of the dimeric, biologically active 

CC TGF-beta-like protein, which is useful for the treatment of wounds 

CC (surface or internal) and cancer in a mammal, in bone and tissue 

CC repair, as a bone marrow protective agent, a mediator of 

CC cardioprotection, for the production of an ant i - inflammatory or 

CC immunosuppressive preparation. Treatment is useful for animals, 

CC especially humans, and wound treatment (e.g. ulcers, bed sores etc.) is 

CC particularly useful for the elderly. 

CC (Updated on 20-MAR-2003 to correct PF field.) 

CC (Updated on 20-MAR-2003 to correct PR field.) 

XX 

SQ Sequence 339 BP; 66 A; 114 C; 100 G; 59 T; 0 other; 

Query Match 85.0%; Score 308.6; DB 20; Length 339; 

Best Local Similarity 94.4%; Pred. No. l.le-63; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 60 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACC:AT 144 

IMIIIIIIIMII llllllll IIIIIIIIIIMM II II IIIIIIIIIIIIIII 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 12 0 

Qy 14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill llllllll llllllll llllllll llllllll Mill llllllllllll 

Db 121 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 18 0 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

llllllll IIIIIIIIIMIII IIIIIIIIIM llllllllllllllllllllllll 

Db 181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 24 0 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIIIIIIIIIIII IIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIll 

Db 241 CAGGCGCTGGAGCCGCiUCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 300 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIIIIIIIIMIMIIIIII llllllllllllllllll 

Db 3 01 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 33 9 



RESULT 13 
ABK84023 



ID ABK84023 standard; cDNA; 650 BP. 
XX 

AC ABK84 023; 
XX 

DT 14-AUG-2002 (first entry) 
XX 

DE Human cDNA differentially expressed in granulocytic cells #594. 
XX 

KW Human; ss; granulocytic cell; DNA chip; bacterial infection; 

KW viral infection; parasitic infection; protozoal infection; 

KW fungal infection; sterile inflammatory disease; psoriasis; 

KW rheumatoid arthritis; glomerulonephritis; asthma; thrombosis; 

KW cardiac reperfusion injury; renal reperfusion injury; ARDS; 

KW adult respiratory distress syndrome; inflammatory bowel disease; 

KW Crohn's disease; ulcerative colitis; periodontal disease; 

KW granulocyte activation; chronic inflammation; allergy, 

XX 

OS Homo sapiens . 
XX 

PN WO200228999-A2. 
XX 

PD ll-APR-2002. 
XX 

PF 03-OCT-2001; 2001WO-US3082 1 . 
XX 

PR 03-OCT-2000; 2000US-237189P . 
XX 

PA (GENE-) GENE LOGIC INC. 
XX 

PI Beazer-Barclay Y, Weissman SM, Yamaga S, Vockley J; 
XX 

DR WPI; 2002-435328/46. 
XX 

PT Detecting granulocyte activation by detecting differential expression 

PT of genes associated with granulocyte activation, which serves as 

PT diagnostic markers that is useful for monitoring disease states and 

PT drug toxicity - 
XX 

PS Claim 1; SEQ ID No 594; 114pp; English. 
XX 

CC The invention relates to detecting (Ml) granulocyte (GC) activation 

CC (OCA), by detecting the level of expression of gene(s) (Gs) identified by 

CC DNA chip analysis as given in the specification, and comparing 

CC the expression level to an expression level in an unactivated 

CC GC, where differential expression of Gs is indicative of GCA. 

CC Also included are modulating (M2) GA by contacting GC with an agent 

CC that alters the expression of at least one gene in Gs; (2) screening (M3) 

CC for an agent capable of modulating GCA or an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease using the 

CC gene expression profile; (3) detecting (M4) an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease, by detecting the 

CC level of expression in a sample of the tissue of gene(s) from Gs, where 

CC the level of expression of the gene is indicative of inflammation; 

CC (4) treating (M5) an inflammation (especially chronic) or in a tissue, 

CC an allergic response in a subject, exposure of a subject to a pathogen 



CC or sterile inflammatory disease, by contacting a tissue having 

CC inflammation with an agent that modulates the expression of gene(s) 

CC from Gs in the tissue. Ml is useful for detecting GCA; M2 is useful for 

CC modulating GA; M3 is useful for screening an agent capable of modulating 

CC GCA preferably in an inflammation in a tissue; M4 is useful for 

CC detecting an inflammation (especially chronic) in a tissue, an allergic 

CC response in a subject, exposure of a subject to a pathogen or sterile 

CC inflammatory disease (e.g. psoriasis, rheumatoid arthritis, 

CC glomerulonephritis, asthma, thrombosis, cardiac reperfusion injury, renal 

CC reperfusion injury, ARDS, adult respiratory distress syndrome, 

CC inflammatory bowel disease, Crohn's disease, ulcerative colitis, 

CC periodontal disease; also bacterial infection, viral infection, 

CC parasitic infection, protozoal infection, fungal infection and M5 is 

CC useful for treating one of the above conditions. The present 

CC sequence represents a gene differentially expressed in granulocytes. 

CC Note: The sequence data for this patent did not form part 

CC of the printed specification, but was obtained in electronic 

CC format directly from WIPO at 

CC ftp.wipo. int/pub/published_pct_sequences . 

XX 

SQ Sequence 650 BP; 108 A; 244 C; 179 G; 119 T; 0 other; 



Query Match 85.0%; Score 308.6; DB 24; Length 650; 

Best Local Similarity 94.4%; Fred. No. 1.2e-63; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

MINIM MMMM M M I M M M M M M M M I M M M M M M M M M I 

Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 60 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

MMMNMMM MMMM MMMMMMM M M MMMMMMMI 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 12 0 

Qy 14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

MMI MMMM MMMM MMMM MMMM MMI MMMMMM 

Db 121 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 18 0 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

MMMM MMMMMMM MMMMMI MMMMMMMMMMMM 

Db 181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 24 0 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 24 

MMMMMMM M M M M M M M M M M M M M M M M M M M M M M I 

Db 241 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 300 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

MMMMMMMMMM MMMMMMMMM 

Db iOl CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 339 



RESULT 14 
ABL68818 

ID ABL68818 standard; DNA; 650 BP. 
XX 

AC ABL68818; 
XX 



DT 15-MAY-2002 (first entry) 
XX 

DE Kidney cancer related gene sequence SEQ ID NO: 7155. 
XX 

KW Human; cancer; colon; breast; ovary; oesophagus; kidney; thyroid; 

KW stomach; lung; prostate; pancreas; carcinoma; antitumour; cancerous; 

KW cytostatic; gene therapy; antineoplastic; Wilm's tumour; adenocarcinoma; 

KW gene; ds. 

XX 

OS Homo sapiens . 
XX 

PN WO200194629-A2. 
XX 



PD 


13 


-DEC- 


2001 






XX 












PF 


30 


-MAY- 


2001; 2001WO 


-US10B3B 


XX 












PR 


05 


-JUN- 


2000 


; 2000US 


-209473P 


PR 


05 


-JUN- 


2000 


; 2000US 


-209531P 


PR 


18 


-SEP- 


2000 


; 2 000US 


-233133P 


PR 


18 


-SEP- 


2000 


; 2000US 


-233617P 


PR 


20 


-SEP- 


2000 


; 2000US 


-234009P 


PR 


20 


-SEP- 


2000 


; 2000US 


-234034P 


PR 


20 


-SEP- 


2000 


; 2 000US 


-234052P 


PR 


22 


-SEP- 


2000 


; 2000US 


-234509P 


PR 


22 


-SEP- 


2000 


; 2 0 00US 


-234567P 


PR 


25 


-SEP- 


2000 


; 2 000US 


-234923P 


PR 


25 


-SEP" 


2000 


; 2000US 


-234924P 


PR 


25 


-SEP- 


2000 


• 2000US 


-235077P 


PR 


25 


-SEP- 


2000 


; 2000US 


-235082P 


PR 


25 


-SEP- 


2000 


• 2000US 


-235134P 


PR 


25 


-SEP- 


2000 


• 2000US 


-235280P 


PR 


26 


-SEP- 


2000 


; 2 000US 


-235637P 


PR 


26 


-SEP- 


2000 


• 2000US 


-235638P 


PR 


27 


-SEP- 


2000 


• 2000US 


-235711P 


PR 


27 


-SEP- 


2000 


• 2000US 


-235720P 


PR 


27 


-SEP- 


2000 


• 2000US 


-235840P 


PR 


27 


-SEP- 


2000, 


• 2000US 


-235863P 


PR 


28 


-SEP- 


2000 


• 2000US 


-236028P 


PR 


28 


-SEP- 


2000, 


2000US 


-236032P 


PR 


28 


-SEP- 


2000, 


2000US 


-236033P 


PR 


28 


-SEP- 


2000 


- 2000US 


-236034P 


PR 


28 


-SEP- 


2000 


2000US 


-236109P 


PR 


28 


-SEP- 


2000 


2000US 


-236111P 


PR 


29 


-SEP- 


2000, 


2000US 


-236842P 


PR 


29 


-SEP- 


2000, 


2000US 


-236891P 


PR 


02 


-OCT- 


2000 


2000US 


-237172P 


PR 


02 


-OCT- 


2000, 


2000US 


-237173P 


PR 


02 


-OCT- 


2000, 


2000US 


-237278P 


PR 


02 


-OCT- 


2000, 


2000US 


-237294P 


PR 


02 


-OCT- 


2000 


2000US 


-237295P 


PR 


02 


-OCT- 


2000, 


2000US 


-237316P 


PR 


03 


-OCT- 


2000, 


2000US 


-237425P 


PR 


03 


-OCT- 


2000, 


2000US 


-237598P 


PR 


03 


-OCT- 


2000, 


2000US 


-237604P 


PR 


03 


-OCT- 


2000, 


2000US 


-237606P 


PR 


03 


-OCT- 


2000, 


2000US 


-237608P 



PR Ol-NOV-2000; 2000US-244867P . 

PR Ol-NOV-2000; 2000US-245084P. 
XX 

PA (AVAL-) AVALON PHARM, 
XX 

PI Young PE, Augustus M, Carter KC, Ebner Endress G, Horrigan S; 

PI Soppet DR, Weaver Z; 

XX 

DR WPI; 2002-188264/24. 
XX 

PT Screening for ant i-neoplast ic agent involves exposing cells to a 

PT chemical agent to be tested for anti-neoplastic activity, and 

PT determining a change in expression of a gene of a signature gene set 

XX 

PS Claim 1; SEQ ID 7155; 44pp; English. 
XX 

CC The present invention describes a method (Ml) for screening for an 

CC anti-neoplastic agent. The method involves exposing cells to a chemical 

CC agent to be tested for anti-neoplastic activity, determining a change in 

CC expression of at least one gene (I) of a signature gene set, where (I) 

CC comprises a sequence (S) selected from 8447 sequences (given in ABL61664 

CC to ABL70110) , or is at least 95% identical to (S) , where a change in 

CC expression is indicative of anti-neoplastic activity. (I) has cytostatic 

CC activity and can be used in gene therapy. Ml can be used for screening 

CC an anti-neoplastic agent, and can be used for producing a product which 

CC is the data collected with respect to the anti-neoplastic agent as a 

CC result of Ml, and the data is sufficient to convey the chemical 

CC structure and/or properties of the agent. Ml can be used in the 

CC treatment of cancer such as colon, breast, stomach, lung, thyroid, 

CC oesophageal, ovarian, kidney, prostate or pancreatic cancer, 

CC adenocarcinoma, carcinoma, clear cell cancer, infiltrating ductal cancer, 

CC infiltrating lobular cancer, squamous cell carcinoma, neuroendocrine 

CC carcinoma, papillary carcinoma and Wilm's tumour. 

XX 

SQ Sequence 650 BP; 108 A; 244 C; 179 G; 119 T; 0 other; 

Query Match 85.0%; Score 308.6; DB 24; Length 650; 
Best Local Similarity 94.4%; Pred. No. 1.2e-63; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 



Db 




Qy 



85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 



Db 




Qy 



14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 



Db 




Qy 



2 05 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 



Db 



181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 24 0 



Qy 



265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 



Db 241 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 300 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

Illlllllllllllllllll llllllllllllllllll 
Db 301 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 33 9 



RESULT 15 

ABZ35738 

ID ABZ35738 Standard; DNA; 1176 BP. 
XX 

AC ABZ35738; 
XX 

DT 07-FEB-2003 (first entry) 
XX 

DE Human TGF beta 1 polynucleotide SEQ ID NO 46. 

XX 

KW Double stranded UNA; dsRNA; RNAi; RNA inhibition; cytostatic; virucide; 

KW protozoacide; gene expression; antisense; tumour; infection; Plasmodium; 

KW virus; viroid; anti-GFP; human; HIV; human immunodeficiency virus; 

KW Hepatitis C virus; human papilloma virus; gene; ds. 
XX 

OS Homo sapiens. 
XX 

PN DE10100588-A1. 
XX 

PD 18-JUL-2002, 
XX 

PF 09-JAN-2001; 2001DE-100058B . 
XX 

PR 09-JAN-2001; 2001DE-1000588 . 
XX 

PA (RIBO-) RIBOPHARMA AG. 
XX 

PI Kreutzer R, Limmer S, Rost S, Hadwiger P; 
XX 

DR WPI; 2002-683450/74. 
XX 

PT Inhibiting expression of target genes, useful e.g. for treating tumors, 

PT by introducing into cells two double- stranded RNAs that are 

PT complementary to the target 
XX 

PS Claim 13; Page 34-35; lOOpp; German. 
XX 

CC The invention relates to inhibiting expression of a target gene in a cell 

CC by introducing at least two oligoribonucleotides (dsRNAI and II), both 

CC with a double- stranded (ds) structure of at most 49 sequential nucleotide 

CC pdiis. Ac least part of one strand (SI, S2) of the ds structures in each 

CC of dsRNAI and II are complementary to regions in the target gene. The 

CC method uses antisense inhibition of gene expression using double stranded 

CC RNA inhibition (RNAi) . The method is particularly used to treat tumours 

CC or infections, especially by Plasmodium or viruses/viroids (pathogenic on 

CC humans, animals or plants) . The method provides more effective inhibition 

CC of expression than known methods using a single dsRNA, even at very low 

CC concentrations. When dsRNA has at least one unpaired nucleotide at the 

CC end, stability (and thus effective concentration in the cell) is 



CC improved and efficiency can be increased further by pretreating the cells 

CC with interferon. The present sequence is that of a target DNA of the 

CC invention. 
XX 

SQ Sequence 1176 BP; 239 A; 382 C; 353 G; 202 T; 0 other; 

Query Match 85.0%; Score 308.6; DB 24; Length 1176; 

Best Local Similarity 94.4%; Pred. No. 1.3e-63; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIII 
Db 838 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 8 97 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IIIIIIMIIIIII llllllll llllllllllllll M II lllllllllllllll 

Db 898 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 957 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

Mill llllllll llllllll llllllll llllllll Mill IIMIMIIIII 

Db 958 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 1017 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

IIIIMM MMMIIMIMI IIIIIMIIII II II I II I M I M 11 1 1 II II I M 

Db 1018 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 1077 

Qy 2 65 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

MIIMMMIMI MIMIIIIMIMMMIIIMIIIIIIIMIIIMIMIIIII 

Db 1078 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1137 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

MMIIIMIIIIMIIIM IIIIMIMIMIMIII 

Db 1138 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1176 



Search completed: October 27, 2003, 19:10:54 
Job time : 110.057 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 27, 2003, 18:35:27 ; Search time 107.894 Seconds 

(without alignments) 
9022.658 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 



US-10-017-372E-10 
363 

1 gactacaaggatgacgacga gttcctgcaagtgcagctga 363 



Scoring table: IDENTITy_NUC 

Gapop 10-0 , Gapext 1.0 



Searched: 



1792395 seqs, 1340900451 residues 



Total number of hits satisfying chosen parameters: 3584790 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq : * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06__PUBCOMB. seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW__PUB. seq: * 
6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq; * 
7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2 /pubpna/US08__PUBCOMB . seq : * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB . seq: * 
10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB. seq: * 

11: / cgn2_6 /p t oda t a / 2 / pubpna /US 0 9 C_PUBCOMB .seq:* 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 
13: / cgn2_6 /p t oda t a / 2 /pubpna /US 1 0 A_PUBCOMB , seq : * 

14: / cgn2_6 /p t oda t a / 2 /pubpna /US 1 0B_PUBCOMB . s eq : * 

15 : / cgn2_6/ptodata/ 2 /pubpna /US10_NEW_PUB .seq: * 

16 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 

17: /cgn2_6/ptodata/2/pubpna/US60__PUBCOMB. seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 

US-09-911-904-167 

; Sequence 167, Application US/09911904 
/ Publication No. US20030096234A1 
/ GENERAL INFORMATION: 
; APPLICANT: Farr, Spencer B. 
; APPLICANT: Pickett, Gavin G. 
; APPLICANT: Neft, Robin Eileen 
; APPLICANT: Dunn, II, Robert Thomas 
; TITLE OF INVENTION: CANINE TOXICITY GENES 
; FILE REFERENCE: 400742000200 
; CURRENT APPLICATION NUMBER: US/09/911,904 
; CURRENT FILING DATE: 2 002-04-09 
; PRIOR APPLICATION NUMBER: US 60/220,057 
; PRIOR FILING DATE: 2000-07-21 
/ NUMBER OF SEQ ID NOS : 386 
/ SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 167 
,- LENGTH : 48 9 
TYPE: DNA 

ORGANISM: Canis familiaris 
FEATURE : 

NAME/KEY: misc_feature 
LOCATION: (1) . . . (489) 

OTHER INFORMATION: n = A, T, C or G 
US-09-911-904-167 



Query Match 86.2%; Score 312.8; DB 11; Length 489; 

Best Local Similarity 95.0%; Pred . No. 2.9e-84; 

Matches 323; Conservative 0; Mismatches 17; Indels 0; Gaps 0; 



Qy 


24 


GGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCT 


83 


Db 


71 


lllllllll IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIII llllllll 
GGCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTCCGGCAGCT 


130 


Qy 


84 


CTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCA 


143 


Db 


131 


IIIIMIIIIIIIII Mill lllllllllllllllll Mill IIMMM Mill 

CTACATTGACTTCCGCAAGGATCTGGGCTGGAAGTGGATCCATGAGCCCAAGGGTTACCA 


190 


Qy 


144 


TGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAA 


203 


Db 


191 


M II IIIMMMIIIMMI IIMMM IIMMM Mill MIMIIIIM 

CGCTAACTTCTGCCTGGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAA 


250 


Qy 


204 


GGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCC 


263 


Db 


251 


MIIIIMI MIIIIIIIMMIMIIIIIIIIIIMMIMIIIIIIIIIIIIIIIII 

GGTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCC 


310 


Qy 


264 


GCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA 


323 


Db 


311 


MIMMMIMMMM IMIIIIIIIIIIIII IIIMMIIIIIIIIIIIIIIMIM 

GCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGC^ 


370 


Qy 


324 


GCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

llllll IIIIIMMIIIII IIIIIIIIMIIIIIIII 

GCTGTCGAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 410 




Db 


371 





RESULT 2 

US-09-813-271B-1 

; Sequence 1, Application (JS/098 13271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 
; (A) Nico Cerletti 

TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US2002 0115834Alart is Patent Department 
STREET: 564 Morris Avenue 
CITY: Summit 
; STATE: New Jersey 

COUNTRY: USA 
ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: PatentIn Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/8 13 , 271B 
FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: PCT/EP95/ 027 19 

FILING DATE: 12 -Jul -95 



APPLICATION NUMBER: EPO 94810439.3 
FILING DATE: 25-Jul-94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 
REGISTRATION NUMBER: 2264 0 

REFERENCE/DOCKET NUMBER: 4 -2003 9C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (908) 522-6940 
TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 339 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: CDNA to mRNA 
HYPOTHETICAL: NO 
IMMEDIATE SOURCE: 

CLONE: E. coll LC137/pPLMu.hTGF-bet.al (DSM 5656) 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1-.336 

OTHER INFORMATION: /product = "human TGF-betal" 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-813-271B-1 

Query Match 85.0%; Score 308.6; DB 10; Length 339; 

Best Local Similarity 94.4%; Pred. No. 4.9e-83; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll lllllllllllllllllllllllllllllllllllllllll 
Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 60 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IIIMIIIMIIII llllllll IIIIIIMIIIIII II II IIIIIMIIIIIIII 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 120 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill llllllll llllllll llllllil llllllil Mill IIMIIIIIIII 
Db 121 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 180 

Qy 2 05 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

llllllll llllllllllllll lllllllllll llilllllllllllllllllllll 
Db 181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 240 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

M M II II M II II 11 II II I II II M II 1 1 M I M I II I M I M M ! M. 1 1 M ! I { ! I 

Db :^4i CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 00 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

Illlllllllllllllllll IIIIIIIMIIIIIIMI 
Db 3 01 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 33 9 



RESULT 3 
US-10-087-268-1 



; Sequence 1, Application US/10087268 

; Publication No. US20030119010A1 

; GENERAL INFORMATION: 

; APPLICANT: Jonsonn, Julie Ruth 

; APPLICANT: Powell, Elizabeth Ellen 

; TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

; FILE REFERENCE: Fibrosis 

; CURRENT APPLICATION NUMBER: US/ 10/ 087 , 268 

; CURRENT FILING DATE: 2002-03-01 

; NUMBER OF SEQ ID NOS : 6 

; SOFTWARE: PatentIn version 3.1 

; SEQ ID NO 1 

; LENGTH: 1821 

TYPE: DNA 

ORGANISM: Human 

FEATURE : 
; NAME/KEY: 5 ' UTR 

LOCATION: (1) . . (511) 
; OTHER INFORMATION: 

NAME /KEY: CDS 

LOCATION: (512) , . (1684) 

OTHER INFORMATION: 

NAME/KEY: sig_peptide 

LOCATION: (512) . . (598) 

OTHER INFORMATION: 

NAME/KEY: 3 'UTR 

LOCATION: (1685) .. (1821) 

OTHER INFORMATION: 
US-10-087-268-1 

Query Match 85.0%; Score 3 08.6; DB 14; Length 1821; 

Best Local Similarity 94.4%; Pred. No. 6.9e-83; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

lllllill lllllill IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 1346 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 14 05 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 



Db 




Qy 



145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 



Db 




Qy 



205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 



Db 




Db 



Qy 




Qy 



325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 
llllllllllllllllllll llllllllllllllllll 



Db 1646 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1684 



RESULT 4 
US-10-087-268-4 

; Sequence 4, Application US/10087268 

; Publication No. US20030119010A1 

; GENERAL INFORMATION: 

; APPLICANT: Jonsonn, Julie Ruth 

; APPLICANT: Powell, Elizabeth Ellen 

; TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

; FILE REFERENCE: Fibrosis 

; CURRENT APPLICATION NUMBER: US/1 0/087 , 268 
; CURRENT FILING DATE: 2002-03-01 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: PatentIn version 3.1 
; SEQ ID NO 4 

LENGTH: 1821 

TYPE: DNA 

ORGANISM: Human 

FEATURE : 

NAME/ KEY: 5 ' UTR 

LOCATION: (1) . . (511) 

OTHER INFORMATION: 

NAME /KEY: CDS 

LOCATION: ( 512 )..( 1684 ) 

OTHER INFORMATION: 
; NAME/ KEY: sig_peptide 

LOCATION: (512) . . (598) 

OTHER INFORMATION: 

NAME/KEY: 3 ' UTR 

LOCATION: (1685 )..( 1821) 

OTHER INFORMATION: 
US-10-087-268-4 

Query Match 8 5.0%; Score 3 08,6; DB 14; Length 1821; 

Best Local Similarity 94.4%; Pred. No. 6.9e-83; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

IIMIIII llllllll MIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

Db 134 6 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 1405 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 



Db 



1406 




Qy 



145 



GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 



Db 



1466 




Qy 



205 



GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 



Db 



1526 




Qy 



265 



CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 



IIIIIIMIIIIII IMIIMIIIMIIIIIIIIIIIIMIIIIIIlllllllllllll 

Db 158 6 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1645 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIIIIIIIIIMIIMIIII IIIIIIIIIIIIMIMI 

Db 1646 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1684 



RESULT 5 

US-10-037-270-220 

Sequence 220, Application US/10037270 
Publication No. US20030104529A1 
GENERAL INFORMATION: 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Asundi , Vinod 
APPLICANT: Zhang, Jie 
APPLICANT: Ren, Feiyan 
APPLICANT: Chen, Rui-hong 
APPLICANT: Zhao, Qing A. 
APPLICANT: Wehrman, Tom 
APPLICANT: Xue, Aidong J. 
APPLICANT: Yang, Yonghong 
APPLICANT: Wang, Jian-Rui 
APPLICANT: Zhou, Ping 
APPLICANT: Ma, Yunqing 
APPLICANT: Wang, Dunrui 
APPLICANT: Wang, Zhiwei 
APPLICANT: Tillinghast, John 
APPLICANT: Drmanac, Radoje T. 

TITLE OF INVENTION: No. US20030104529Alel Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 784CIP2B 

CURRENT APPLICATION NUMBER: US/10/037,270 
CURRENT FILING DATE: 2002-01-04 
PRIOR APPLICATION NUMBER: 09/552,317 
PRIOR FILING DATE: 2000-04-25 
PRIOR APPLICATION NUMBER: 09/488,725 
PRIOR FILING DATE: 2000-01-21 
NUMBER OF SEQ ID NOS : 1104 
SOFTWARE: pt_FL_genes Version 1.0 
SEQ ID NO 220 
LENGTH: 2742 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (842) . . (2014) 
US-10-037-270-220 

Query Match 85.0%; Score 3 08.6; DB 14; Length 2742; 

Best Local Similarity 94.4%; Pred. No. 7.5e-83; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0/ 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

IIIIMII llllllll IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1676 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 1735 



Qy 


85 


Db 


1736 


Qy 


145 


Db 


1796 


Qy 


205 


Db 


1856 


Qy 


265 


Db 


1916 


Qy 


325 


Db 


1976 



TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

llllllllllllll llllllll llllllllllllll II II lllllllllllllll 
TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 1795 

GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

Mill llllllll llllllll llllllll llllllll Mill IMIIIIMIM 

GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 1855 



Illllll llllllllllllll lllllllllll lllllllllllllllllllllll 

ttcctggccctgtacaaccagcataacccgggcgcctcggcggcgccgtgctgcgtgcq 

:aggcgctggagccactgcccatcgtgtactacgtgggccgcaagcccaaggtggagca( 
IMIIIIIIIIII IIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIM 



CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

Illlllllllllllllllll lllllllllllllllill 

CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2014 



RESULT 6 

US-09-948-002-28 

; Sequence 28, Application US/09948002 

; Publication No. US20030050265A1 

; GENERAL INFORMATION: 

; APPLICANT: Nicholas M. Dean 

; APPLICANT: Susan F. Murray 

; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
; TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE: ISPH-0607 

; CURRENT APPLICATION NUMBER: US/09/94 8,002 

; CURRENT FILING DATE: 2000-09-05 

; PRIOR APPLICATION NUMBER: 09/661,753 

; PRIOR FILING DATE: 2000-09-14 

; PRIOR APPLICATION NUMBER: 60/154,546 

PRIOR FILING DATE: 1999-09-17 
; NUMBER OF SEQ ID NOS : 71 
; SEQ ID NO 28 

LENGTH: 2745 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE : 
; NAME /KEY: CDS 

LOCATION: (842) ... (2017) 
US-09-948-002-28 

Query Match 85.0%; Score 308.6; DB 11; Length 2745; 

Best Local Similarity 94.4%; Pred. No. 7.5e-83; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll IIIIIIMIIIIMillllllllllllllllllllMIIII 

Db 1679 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 1738 



Qy 



85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 



Db 


1739 


Qy 


145 


Db 


1799 


Qy 


205 


Db 


1859 


Qy 


265 


Db 


1919 


Qy 


325 


Db 


1979 



IIIIMIIIIIIII MIIIMI IIIIIIIIIIMII II II IMIIIIllllllll 

TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 

GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 

Mill llllllll llllllll llllllll IMIIIII Mill IIIIIMIIIII 

GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 
GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 

MIIIMI IIIMMIMIIM IIIIIIIIIM 1 1 II II 1 1 II M 1 1 1 II II I M II 

GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 

CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 

MIIIMIIMMI MIMIIMMIMIIIMMIMMMMMMIIMMMIM 

CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 

CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 
IIIIIIIIIIIIIIMIIII IIMIIMIIIIIIIIII 



RESULT 7 

US-09-948-002-27 

Sequence 27, Application US/09948002 
Publication No. US20030050265A1 
GENERAL INFORMATION: 
APPLICANT: Nicholas M. Dean 
APPLICANT: Susan F. Murray 

TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
TITLE OF INVENTION: FACTOR BETA EXPRESSION 
FILE REFERENCE: ISPH-0607 

CURRENT APPLICATION NUMBER: US/09/948 , 002 
CURRENT FILING DATE: 2000-09-05 
PRIOR APPLICATION NUMBER: 09/661,753 
PRIOR FILING DATE: 2000-09-14 
PRIOR APPLICATION NUMBER: 60/154,546 
PRIOR FILING DATE: 1999-09-17 
NUMBER OF SEQ ID NOS : 71 
SEQ ID NO 27 
LENGTH: 1585 
TYPE: DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (413) . . . (1585) 
US-09-948-002-27 

Query Match 79.3%; Score 287.8; DB 11; Length 1585; 

Best Local Similarity 90.6%; Pred. No. l,2e-76; 

Matches 307; Conservative 0; Mismatches 32; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

IMMMIIIIIMIIIIIIIIIIIIIIIIM IIMIIIIIMIII M IIIMIII 

Db 1247 GCCCTGGATACCAACTACTGCTTCAGCTCCACAGAGAAGAACTGCTGTGTACGGCAGCTG 1306 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IMIIIlllll MllllllllIM IIIIIIMIII II II IMIIIIIIIIIIII 

Db 1307 TACATTGACTTTAGGAAGGACCTGGGTTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 1366 



Qy 



145 



GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 



Db 



1367 




Qy 



205 



GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 



Db 



1427 




Qy 



265 



CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 



Db 



1487 




Qy 



325 




Db 



1547 



RESULT 8 
US-09-948-002-1 

; Sequence 1, Application US/09948002 

; Publication No. US20030050265A1 

; GENERAL INFORMATION: 

; APPLICANT: Nicholas M. Dean 

; APPLICANT: Susan F. Murray 

; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
; TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE: ISPH-0607 

; CURRENT APPLICATION NUMBER: US/09/948 , 002 

; CURRENT FILING DATE: 2000-09-05 

; PRIOR APPLICATION NUMBER: 09/661,753 

; PRIOR FILING DATE: 2000-09-14 

; PRIOR APPLICATION NUMBER: 60/154,546 

; PRIOR FILING DATE: 1999-09-17 

; NUMBER OF SEQ ID NOS : 71 

; SEQ ID NO 1 

LENGTH: 2094 

TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE ; 
NAME/ KEY: CDS 
LOCATION: (868) ... (2040) 
US-09-948-002-1 

Query Match 79.3%; Score 287,8; DB 11; Length 2094; 

Best Local Similarity 90.6%; Pred. No. 1.3e-76; 

Matches 307; Conservative 0; Mismatches 32; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

IIIIIIIIIIIIIIIII IMIIIIIIIMII llllllllllllll Illllllllll 

Db 17 02 GCCCTGGATACCAACTATTGCTTCAGCTCCACAGAGAAGAACTGCTGTGTGCGGCAGCTG 1761 

Qy 8 5 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 



Db 




Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 



Db 1822 GCCAACTTCTGTCTGGGACCCTGCCCCTATATTTGGAGCCTGGACACACAGTACAGCAAG 1881 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

lllll II II IIMIIII llllllllllllll llllll I lllllllllllllll 

Db 1882 GTCCTTGCCCTCTACAACCAACACAACCCGGGCGCTTCGGCGTCACCGTGCTGCGTGCCG 1941 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

lllll MIMIIIlllMIIIMIIIIIIIIMII IIIMMMMIIMMIIII 

Db 1942 CAGGCTTTGGAGCCACTGCCCATCGTCTACTACGTGGGTCGCAAGCCCAAGGTGGAGCAG 2001 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

Illllllllllll lllll lllllllillllllllil 
Db 2 002 TTGTCCAACATGATTGTGCGCTCCTGC7UVGTGCAGCTGA 2 04 0 



RESULT 9 

US-09-813-271B-7 

; Sequence 7, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nice Cerletti 
TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US20020115834Alart is Patent Department 
STREET: 564 Morris Avenue 
; CITY: Summit 

; STATE: New Jersey 

COUNTRY: USA 
ZIP: 07901 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentin Release #1.0, Version #1,30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/813 , 2 71B 
FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA; 

APPLICATION NUMBER: PCT/EP95/0271 9 
FILING DATE: 12 -Jul -95 
APPLICATION NUMBER: EPO 94810439.3 
FILING DATE: 25 -Jul -94 
ATTORNEY /AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 
; REGISTRATION NUMBER: 22640 

REFERENCE/DOCKET NUMBER: 4 -2 003 9C/C1C1/USN 
/ TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 
TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 336 base pairs 

; TYPE: nucleic acid 



STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: Other nucleic acid 

DESCRIPTION: /desc = "recombinant hybrid DNA of 
IMMEDIATE SOURCE: 

CLONE: E. col i LC137/pPLMu . TGF-betal (44/45 ) beta3 
FEATURE : 

NAME/KEY : ma t_pept ide 
LOCATION: 1. . 132 

OTHER INFORMATION: /product = "N- terminal 44 amino 
acids of human TGF-betal" 
FEATURE: 

NAME/KEY: mat_peptide 
LOCATION: 133.. 336 

OTHER INFORMATION: /product = "C-terminal 68 amino 
acids of human TGF-beta3" 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1. .336 

OTHER INFORMATION: /product= "hybrid TGF-beta named 
TGF-betal -3" 
SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
US-09-813-271B-7 

Query Match 62.6%; Score 227.2; DB 10; Length 336; 

Best Local Similarity 79.8%; Pred. No. 1.4e-58; 

Matches 268; Conservative 0; Mismatches 68; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll IMIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 60 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

llllllllllllll llllllll llllllllllllll II M IIIIIIIIIIIIIM 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 12 0 

Qy 14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill iiiiii II II II M III II I II iiiiii mill I 

Db 121 GCCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCACG 18 0 

Qy 2 05 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

II INI IIIIMIII I IIMI I II II II INI lllllllllll 

Db 181 GTGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCC 24 0 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

MM llllllll III MM! lllllll II II II Mill lllllllll 

Db 241 CAGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAG 300 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGC 3 60 

II MIIIIIM I III M M II II Ml 

Db 3 01 CTCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGC 336 



RESULT 10 

US-09-906-158-10 

; Sequence 10, Application US/09906158 
; Publication No. US20030078217A1 



GENERAL INFORMATION: 

APPLICANT: Brett P. Monia 
APPLICANT: Susan M. Freier 

TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 

FILE REFERENCE: RTS-0257 

CURRENT APPLICATION NUMBER: US/ 0 9/906 , 158 
CURRENT FILING DATE: 2001-07-14 
NUMBER OF SEQ ID NOS : 168 
SEQ ID NO 10 
LENGTH: 2 879 
TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (611) . . . (1843) 
US-09-906-158-10 

Query Match 58.2%; Score 211.2; DB 11; Length 2879; 

Best Local Similarity 75.9%; Pred . No. 1.4e-53; 

Matches 261; Conservative 0; Mismatches 83; Indels 0; Gaps 0; 
Qy 20 ACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGC 79 

I I lllllllll lllll MIIMIII M I nil IIIIIMIII M II I 

Db 1500 AGAGGGCCCTGGACACCAATTACTGCTTCCGCAACCTGGAGGAGAACTGCTGTGTACGCC 1559 

Qy 80 AGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCT 139 

II II llllllllllll MM II llllllll III I M lllll lllll I 

Db 1560 CCCTTTATATTGACTTCCGGCAGGATCTAGGCTGGAAATGGGTCCACGAACCTAAGGGTT 1619 

Qy 140 ACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACA 199 

II IIIIIM llllll II II II M III II I Ml IMIII I I 

Db 1620 ACTATGCCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGCGCAGACACAACCCATA 1679 

Qy 2 00 GCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCG 2 59 

III III II I II IMIM I lllll I lllll M MM lllllll 

Db 1680 GCACGGTGCTTGGACTATACAACACCCTGAACCCAGAGGCGTCTGCCTCGCCATGCTGCG 1739 

Qy 260 TGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGG 319 

I M MM IIMIIII III Mill lllllll IMIM I I llllllllll 

Db 1740 TCCCCCAGGACCTGGAGCCCCTGACCATCTTGTACTATGTGGGCAGAACCCCCAAGGTGG 1799 

Qy 32 0 AGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 3 63 

IIIIIMIIIIIIIIII I III II II IIIIIIIMIII 
Db 1800 AGCAGCTGTCCAACATGGTGGTGAAGTCGTGTAAGTGCAGCTGA 1843 



RESULT 11 
US-09-957-4 58B-y 

; Sequence 9, Application US/09957458B 
; Publication No. US20030166271A1 
; GENERAL INFORMATION: 
; APPLICANT: Chen, Una 

TITLE OF INVENTION: Method for growing stem cells 
; FILE REFERENCE: P66567US0 

; CURRENT APPLICATION NUMBER: US/09/957 , 458B 
; CURRENT FILING DATE: 2001-09-21 



PRIOR APPLICATION NUMBER: PCT/EPOO/08247 
PRIOR FILING DATE: 2000-08-24 
PRIOR APPLICATION NUMBER: EP 99116533 
PRIOR FILING DATE: 1999-08-24 
NUMBER OF SEQ ID NOS : 10 
SOFTWARE: Patentin version 3.2 
SEQ ID NO 9 
LENGTH: 4382 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Vector for transforming supporting cell with a foreign to 
express 

OTHER INFORMATION: a gene product of interest 
US-09-957-458B-9 

Query Match 58.2%; Score 211.2; DB 12; Length 4382; 

Best Local Similarity 75.9%; Pred. No. 1.5e-53; 

Matches 261; Conservative 0; Mismatches 83; Indels 0; Gaps 0; 
Qy 2 0 ACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGC 79 

I I MIIMMI Mill lllllllll M I nil llllllllll II II I 

Db 1353 AGAGGGCCCTGGACACCAATTACTGCTTCCGCAACCTGGAGGAGAACTGCTGTGTACGCC 1412 

Qy 8 0 AGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCT 13 9 

II II IIIIIIIIIIM INI II IIIIIIII III I II Mill Mill I 

Db 1413 CCCTTTATATTGACTTCCGGCAGGATCTAGGCTGGAAATGGGTCCACGAACCTAAGGGTT 14 72 

Qy 14 0 ACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACA 199 

II IMIIII IIIIM II II II II III II I III IMIII I I 

Db 1473 ACTATGCCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGCGCAGACACAACCCATA 1532 

Qy 2 00 GCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCG 259 

Ml III M I II MUM I Mill I Mill II MM IIIIIII 

Db 1533 GCACGGTGCTTGGACTATACAACACCCTGAACCCAGAGGCGTCTGCCTCGCCATGCTGCG 1592 

Qy 260 TGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGG 319 

I II MM IIIIIIII III Mill IIIIIII IIIIII I I llllllllll 

Db 1593 TCCCCCAGGACCTGGAGCCCCTGACCATCTTGTACTATGTGGGCAGAACCCCCAAGGTGG 1652 

Qy 32 0 AGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

MIIIMIIIIIIIIII I III II II MIMIIMIII 

Db 1653 AGCAGCTGTCCAACATGGTGGTGAAGTCGTGTAAGTGCAGCTGA 1696 



RESULT 12 

US-09-906-158-3 

; Sequence 3, Application US/09906158 

; Publication No. US20030078217A1 

; GENERAL INFORMATION: 

; APPLICANT: Brett P. Monia 

; APPLICANT: Susan M. Freier 

; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 

; FILE REFERENCE: RTS-0257 

; CURRENT APPLICATION NUMBER: US/09/906,158 
; CURRENT FILING DATE: 2001-07-14 



NUMBER OF SEQ ID NOS : 168 

SEQ ID NO 3 
LENGTH: 2574 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (254) . . . (1492) 
US-09-906-158-3 

Query Match 58.0%; Score 210.4; DB 11; Length 2574; 

Best Local Similarity 76.2%; Pred. No. 2.4e-53; 

Matches 259; Conservative 0; Mismatches 81; Indels 0; Gaps 0; 

Qy 24 GGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCT 83 

III MM Mill lllllllll II I nil llllllllll Mill I II 

Db 1153 GGCTTTGGACACC^y^TTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCCT 1212 

Qy 84 CTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCA 143 

MMMMMMIM MM IIMIMIMIIIII I MIIIMI MMIIMI I 

Db 1213 CTACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACTA 1272 

Qy 144 TGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAA 203 

MMM IIIMI II M M M III II I II llllll llllll 

Db 1273 TGCCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCAC 1332 

Qy 204 GGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCC 2 63 

Ml MM MIMMM I Mill I M M M MM MMIMIIII 

Db 1333 GGTGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCC 13 92 

Qy 264 GCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA 323 

MM MIIIMI Ml Mill IIIIIII MM II Mill MIIIMI 

Db 1393 CCAGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCA 1452 

Qy 324 GCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

Ml MMIIMI I III II II II II MMM 

Db 1453 GCTCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGA 1492 



RESULT 13 
US-10-028-158-20 

; Sequence 20, Application US/10028158 

; Publication No. US20020110833A1 

; GENERAL INFORMATION: 

; APPLICANT: Caniggia, Isabella 

; APPLICANT: Post, Martin 

; APPLICANT: Lye, Stephen 

; TITLE OF INVENTION: METHODS TO DIAGNOSE A REQUIRED REGULATION OF 

; TITLE OF INVENTION: TROPHOBLAST 

; FILE REFERENCE: 11757. 38USWO 

; CURRENT APPLICATION NUMBER: US/ 10/028 , 158 

; CURRENT FILING DATE: 2001-12-20 

; PRIOR APPLICATION NUMBER: US/09/380,662 

PRIOR FILING DATE: 1999-12-21 

PRIOR APPLICATION NUMBER: PCT/CA98/0018 0 

PRIOR FILING DATE: 1998-03-05 
; PRIOR APPLICATION NUMBER: US 60/039,919 



PRIOR FILING DATE: 1997-03-07 
NUMBER OF SEQ ID NOS : 24 
SOFTWARE: Patent In version 3.0 
SEQ ID NO 20 
LENGTH: 2574 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (254) . . (1492) 
US-10-028-158-20 

Query Match 58.0%; Score 210.4; DB 13; Length 2574; 

Best Local Similarity 76.2%; Fred. No. 2.4e-53; 

Matches 259; Conservative 0; Mismatches 81; Indels 0; Gaps 0; 
Qy 24 GGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCT 83 

III Mil Mill IIIIMMI M I MM MIIMMII IMM I M 

Db 1153 GGCTTTGGACACC:AATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCCT 1212 

Qy 84 CTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCC::AAGGGCTACCA 143 

MMMMIIMIM MM MMMMMMIM I MMIMl MIMMII 1 

Db 1213 CTACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACTA 1272 

Qy 144 TGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCIAA 203 

IIMII llllll II II M II III II I M llllll llllll 

Db 1273 TGCCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCAC 1332 

Qy 204 GGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCC 263 

III MM MIMMII I Mill I II II M MM IMIIIIIIII 

Db 1333 GGTGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCC 1392 

Qy 264 GCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA 323 

MM IIIMMI Ml MM! MMIM MM II IMM IIMMM 

Db 13 93 CCAGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCA 14 52 

Qy 324 GCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 3 63 

III IMIIIIII I III II II II II IIMII 

Db 14 53 GCTCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGA 14 92 



RESULT 14 
US-09-813-271B-5 

; Sequence 5, Application US/09813271B 
; Patent No. US200201 15834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 
; biologically active protein 

NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US20020115834Alartis Patent Department 

STREET: 564 Morris Avenue 

CITY: Summit 

STATE: New Jersey 

COUNTRY: USA 



ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER; IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/813 , 271B 
FILING DATE: 20-Mar-2001 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/02719 
FILING DATE: 12 -Jul -95 
APPLICATION NUMBER: EPO 94810439.3 
FILING DATE: 25 -Jul -94 
ATTORNEY /AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 
REGISTRATION NUMBER: 2264 0 

REFERENCE/DOCKET NUMBER: 4 -2003 9C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 339 base pairs 
; TYPE: nucleic acid 

; STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
IMMEDIATE SOURCE: 

CLONE: E. coli LC13 7/pPLMu . hTGF-beta3 (DSM 5658) 
FEATURE : 

NAME/KEY: CDS 

LOCATION: 1..336 

OTHER INFORMATION: /product= "human TGF-beta3" 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-813-271B-5 

Query Match 57.7%; Score 209.4; DB 10; Length 339; 

Best Local Similarity 76.1%; Pred. No. 3.2e-53; 

Matches 2 58; Conservative 0; Mismatches 81; Indels 0; Gaps 0; 

Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 



Db 



1 




Qy 



85 



TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 14 4 



Db 



61 




Qy 



145 



GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 



Db 



121 




Qy 



205 



GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 



Db 



181 




Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

MM MMMM Ml MMI MMMI MM II Mill IIIIIMII 

Db 241 CAGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGG^G 3 00 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 3 63 

II lllllllll I III M II II M MUM 

Db 301 CTCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGA 339 



RESULT 15 
US-09-813-271B-11 

; Sequence 11, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US20020115834Alartis Patent Department 

STREET: 564 Morris Avenue 

CITY: Summit 

STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/813 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/02719 

FILING DATE: 12 -Jul -95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25-Jul-94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 22 640 

REFERENCE/DOCKET NUMBER: 4 -20039C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
; INFORMATION FOR SEQ ID NO: 11: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 336 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid 

DESCRIPTION: /desc = "recombinant hybrid DNA 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu . TGF-beta3 (44/45) beta2 
FEATURE : 



NAME/ KEY : mat_pept ide 
LOCATION: 1. . 132 

OTHER INFORMATION: /product= "N-terminal 44 amino 
acids of human TGF-beta3" 
FEATURE : 

NAME/KEY: mat_peptide 
LOCATION: 133 . .336 

OTHER INFORMATION: /product = "C- terminal 68 amino 
acids of human TGF-beta2" 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1..336 

OTHER INFORMATION: /product= "hybrid TGF-beta3-2" 
SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
US-09-813-271B-11 

Query Match 50.2%; Score 182.4; DB 10; Length 336; 

Best Local Similarity 71.4%; Pred. No. 4.1e-45; 

Matches 240; Conservative 0; Mismatches 96; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

II MM IIMI IMIIMII M I MM MIMIIIII Mill I Ml 

Db 1 GCTTTGGACACCAATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCCTC 60 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IIMIMIIIIIII MM MMMIIMIMII I IIMIMI MIMMII M 

Db 61 TACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACTAT 12 0 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

Mill MM! II I II II II I Mill llllllllll llllll I 

Db 121 GCCAACTTCTGTGCTGGAGCATGCCCGTATTTATGGAGTTCAGACACTCAGCACAGCAGG 18 0 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

llllll I II II II II I II II II I II MIMMII I 

Db 181 GTCCTGAGCTTATATAATACCATAAATCCAGAAGCATCTGCTTCTCCTTGCTGCGTGTCC 24 0 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

III III II II MM I llllll I III I MUM I M III 

Db 241 CAAGATTTAGAACCTCTAACCATTCTCTACTACATTGGCAAAACACCCAAGATTGAACAG 300 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGC 360 

II II II Mill II M Mill llllll 

Db 3 01 CTTTCTAATATGATTGTAAAGTCTTGCAAATGCAGC 336 



Search completed: October 28, 2003, 09:04:08 
Job time : 109.037 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 27, 2003, 10:47:27 ; Search time 900.804 Seconds 

( wi thout a 1 ignment s ) 
9794.056 Million cell updates/sec 



Title: 

Perfect score 
Sequence : 



US-10-017-372E-10 
363 

1 gactacaaggatgacgacga 



gttcctgcaagtgcagctga 363 



Scoring table 



IDENTITY_NUC 
Gapop 10.0 , 



Gapext 1 . 0 



Searched: 



22781392 seqs, 12152238056 residues 



Total number of hits satisfying chosen parameters: 



45562784 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -process ing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



EST:* 



1: em_estba:* 

2: em_esthum:* 

3: em_estin:* 

4: em_estmu:* 

5: em_estov:* 

6: em_estpl:* 

7: em_estro:* 

8: em_htc:* 

9: gb_estl:* 

10: gb_est2:* 

11: gb_htc:* 

12 : gb__est3 : * 

13 : gb_est4 : * 
14: gb_est5:* 
15: em__estfun:* 
16: em_estom:* 
17: em_gss_hum:* 
18: em_gss_inv:* 
19: em_gss_pln:* 
2 0 : em_gs s_vr t : * 
21: em_gss_fun:* 
22 ; em_gss_mam: * 
2 3 : em_gs s_mu s : * 
2 4 : em_gs s_pro : * 
2 5 : em_g s s_r od : * 
2 6 : em_gs s_phg : * 
2 7 : em_gss_vrl : * 
28: gb_gssl:* 
29: gb_gss2:* 



Pred. No, is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Result 



SUMMARIES 



No. Score Match Length DB ID 



Description 
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ALIGNMENTS 



RESULT 1 
AI951831/C 

LOCUS AI951831 598 bp mRNA linear EST 06-SEP-1999 

DEFINITION wx38b08.xl NCI_CGAP_Pitl Homo sapiens cDNA clone IMAGE : 2545911 3' 
similar to gb:X02812_cdsl TRANSFORMING GROWTH FACTOR BETA 1 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



PRECURSOR (HUMAN) ; contains PTR7.t3 MER22 repetitive element mRNA 

sequence . 
AI951831 

AI951831.1 GI:5744141 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 598) 

NCI -CGAP ht tp : / /www . ncbi . nlm . nih . gov/nc icgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Chris Moskaluk, M.D., Ph.D., Michael R. 
Emmert-Buck, M.D., cDNA Library Preparation: Life Technologies, 
Inc. cDNA Library Arrayed by: Christa Prange, The I.M.A.G.E, 
Consortium DNA Sequencing by: Washington University Genome 
Sequencing Center 

Clone distribution: NCI -CGAP clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
www-bio. llnl .gov/bbrp/ image/ image. html 
Seq primer: -4 0UP from Gibco 
High quality sequence stop: 425. 
Locat ion/ Qual i f iers 
1. .598 

/organism="Homo sapiens" 
/ mo 1 _ t yp e = " mRN A " 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 2545911" 

/tissue_type=" four pooled pituitary adenomas" 

/lab_host=" DHIOB" 

/ c 1 one_l i b= " NC I _CGAP_P i 1 1 " 

/note="Organ: brain; Vector: pCMV-SP0RT6; Site_l: Sail; 

Site_2: NotI; Cloned unidirectionally . Primer: Oligo dT. 
Library constructed by Life Technologies." 
95 a 173 c 222 g 107 t 1 others 



Query Match 85 . 5%; 

Best Local Similarity 93.4%; 
Matches 324; Conservative 



Score 310.2; DB 9; Length 598; 
Pred. No. 1.8e-66; 
0; Mismatches 23; Indels 0; 



Gaps 



0; 



Qy 



Db 



17 ACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGC 76 

II I I IIIMIII llllllll IIIIIIIIMIMIIIIIIIIIIIIIIIIIIMI 

484 ACCGCGAAGCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGC 425 



Qy 

Db 

Qy 

Db 



77 GGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGG 136 

Mlllll IIIIIIIIIMIII IIIMIII MIIIIIIMIIII II II lllllll 

424 GGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGG 365 

137 GCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGT 196 

IIIIIMIIIIII llllllll IIIIIIM llllllll llllllll Mill MM 

3 64 GCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGT 3 05 



Qy 197 ACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCT 256 

IIIIIIIIMMMII llllllllllllll IIIIIMIIII IIIMIIIIIIIIIII 

Db 3 04 ACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCT 24 5 

Qy 257 GCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGG 316 

llllllllllllllllllllll IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 244 GCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGG 185 

Qy 317 TGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

MMIIIMMIMMMIMMIIMI 1 1 1 M M 1 1 1 M M M 1 1 

Db 184 TGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 138 



RESULT 2 

BI868266 

LOCUS BI868266 396 bp rtiRNA linear EST ll-OCT-2001 

DEFINITION 603392221F1 NIH_MGC_90 Homo sapiens cDNA clone IMAGE : 5402212 5', 

mRNA sequence. 
ACCESSION BI868266 
VERSION BI868266.1 GI: 16041939 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 396) 

AUTHORS NIH-MGC http : //mgc . nci . nih . gov/ . 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 
JOURNAL Unpubl i shed 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort i urn/ LLNL at: 
http : // image . llnl . gov 
Plate: LLAM12025 row: j column: 05 
High quality sequence stop: 383, 
FEATURES Locat ion/Qual i f iers 

source 1. .396 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 54 02212" 

/tissue__type= "adenocarcinoma , cell line" 
/lab_host="DH10B (phage-resistant) " 
/ cl one_l i b= " N I H_MGC_9 0 " 

/note="Organ: liver; Vector: pCMV-SP0RT6; Site_l: NotI; 

Site_2: Sail; Cloned unidirectionally; oligo-dT primed. 

Average insert size 1.7 kb. Library enriched for 

full-length clones and constructed by Life Technologies. 

Note: this is a NIH_MGC Library." 
BASE COUNT 75 a 135 c 121 g 65 t 

ORIGIN 



Query Match 85.0%; Score 308,6; DB 12; Length 396; 

Best Local Similarity 94.4%; Pred. No. 3.8e-66; 

Matches 320; Conservative 0; Mismatches 19; indels 0; Gaps 0; 



Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll lllllllllllllllllllllllllllllllllllllllll 
Db 41 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 100 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

MMIIMMMM IIMIIM IIMIIMMIMI 11 M MMIMMMIIII 

Db 101 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 16 0 

Qy 14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill llllllll llllllll Mllllll llllllll INN llllllllllll 

Db 161 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 220 

Qy 2 05 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

llllllll Mllllllllllll lllllllllll llllllllllllllllllllllll 

Db 221 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 280 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IMIIMIMMII MIIIIIIIIIMIIIIIMIIIMIIMIMMIIIIMIIIII 

Db 281 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 340 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

Illlllllllllllllllll IMIIIIIIIIIIIIIII 

Db 341 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 379 



RESULT 3 
BI820759 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 



BI820759 400 bp mRNA linear EST 04-OCT-2001 

603034355F1 NIH_MGC_115 Homo sapiens cDNA clone IMAGE : 5175732 5», 
mRNA sequence. 
BI820759 

BI 82 0759. 1 GI : 15932309 
EST. 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 400) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r(Smail .nih .gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : //image . llnl .gov 
Plate: LLAM11437 row: m column: 13 
High quality sequence stop: 364. 
Locat ion/Qual i f iers 



source 



BASE COUNT 
ORIGIN 



1. .400 

/organism="Homo sapiens" 
/ mo 1 _ t yp e = " mRN A " 
/db_xref ="taxon: 9606" 
/clone=" IMAGE : 5175732 " 
/lab_host="DH10B" 
/clone_l ib= "NIH_MGC_115 " 

/not:e=" Organ: pooled brain, lung, testis; Vector: 
pCMV-SP0RT6; Site_l: NotI; Site_2: EcoRV (destroyed); RNA 
source anonymous pool of 6 male brains, age range 23-27; 1 
male lung, age 27; and 1 male testis, age 69. Library is 
oligo-dT primed and direct ionally cloned (EcoRV site is 
destroyed upon cloning) . Average insert size 1.8 kb, 
insert size range 1-3 kb. Library is normalized and 
enriched for full-length clones and was constructed by C. 
Gruber (Invitrogen) . Research Genetics tracking code 
021. Note: this is a NIH^MGC Library." 
73 a 131 c 131 g 65 t 



Query Match 85.0%; 
Best Local Similarity 94.4%; 
Matches 320; Conservative 



Score 308.6; DB 12; 
Pred. No. 3.8e-66; 
0; Mismatches 19; 



Length 4 00; 



Indels 



0 ; Gaps 



0; 



Qy 

Db 



25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll IIIIIIM IIIIMIIIIIIIIMIIIIIMIIMIIIIIIIIIIIIII 

2 0 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 79 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

Illlllllllllll llllllll Illlllllllllll II II lllllllllllllll 
8 0 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 13 9 

145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

Mill llllllll IIIMIII llllllll llllllll Mill llllllllllll 

14 0 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 199 
205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

IIIMIII IMIIIIIIIIIII lllllllllll IIIIIIMIIIIIIIIIIIIIIM 

2 00 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 259 
265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IMIIIIMIIIII IMIIIMIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIII 

260 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 319 
325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIIMIIIIIIMIIIIIII IIIIIIIMIIIIIMII 

32 0 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 358 



RESULT 4 
BM740537 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 



BM740537 

K-EST0011722 S1SNU5 Homo 

sequence. 

BM740537 

BM740537.1 GI :19061866 
EST. 



461 bp mRNA linear 
sapiens cDNA clone S1SNU5- 



EST Ol-MAR-2002 
5-B09 5' , mRNA 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebra ta; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 461) 

Kim,N.S., Hahn.Y., Oh,J.H., Lee^J.Y., Ahn.H.Y., Chu^M.Y., Kim,M.R., 
Oh.K.J., Cheong,J.E., Sohn,H.Y., Kim,J.M., Park,H.S., Kim,S- and 
Kim, Y.S. 

21C Frontier Korean EST Project 2 001 
Unpublished 
Contact: Kim YS 
Genome Research Center 

Korea Research Institute of Bioscience & Biotechnology 

52 Eoeun-dong Yuseong-gu, Daejeon 305-333, South Korea 

Tel : +82-42-860-4470 

Fax: +82-42-860-4409 

Email : yongsung@mail .kribb.re.kr 

Plate: 5 row: B column: 09 

High quality sequence stop: 461. 

Location/Qualifiers 

1. .461 

/ organ ism=" Homo sapiens" 
/ mo 1 _ t yp e = " mRNA " 
/db_xref ="taxon: 9606" 
/clone="SlSNU5-5-B09" 
/sex="F" 

/t issue_type= "Ascites " 

/eel l_type= " Lymphobl as t - 1 ike " 

/cell_line="SNU-5" 
/lab_host="ToplOF' " 
/clone_l ib= " S1SNU5 " 

/note="Organ: Stomach; Vector: pCNS; Site_l: EcoRI ; 
Site_2: NotI; The poly (A) + RNA was dephosphorylated with 
bacterial alkaline phosphatase (BAP) and then decapped 
with tabacco acid pyrophosphatase (TAP) . The decapped 
intact mRNA was ligated with DNA-RNA linker including EcoR 
I site by treatment of T4 RNA ligase and the first strand 
cDNA was synthesized from oligo dT-selected mRNA by 
priming with dT-tailed vector. The dT-tailed vector was 
adjusted to have about 60nt. The cDNA vector was 
circularized with E. coli DNA ligase after digestion of 
EcoRI which site is also included in vector. An RNA strand 
converted to a DNA strand by Okayama-Berg method. The 
obtained cDNA vectors were used for transformation of 
competent cells E. coli Topi OF' by electroporat ion method. 
The cDNA libraries constructed by this method are 
full-length enriched cDNA library." 
86 a 161 c 133 g 81 t 



Query Match 85.0%; Score 308.6; DB 12; Length 461; 

Best Local Similarity 94.4%; Pred. No. 4e-66; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll IIIIIIIIIIIIIMIIIIIIIIIIIIIMMIIIIIIIII 

Db 119 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 178 



Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IIMIIIIIMIM llllllll IIIIIIIIIIIMI II II lllllllllllllll 

Db 179 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 238 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

lllll IIIMMI llllllll llllllll llllllll Mill IMIIIIIMII 

Db 23 9 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 2 98 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

llllllll IMIIIIIIIIIII IIMIIIIIII IIIIIIIIIIIIIIIIIIIIMII 

Db 2 99 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 358 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIIIIIIIIIMI MIMIIIIIIIIIIIMIIIMIIIIIIIMIIIIIIMIIIII 

Db 359 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 418 

Qy 325 CTGTGCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

llllllllllllllllllll IMIIIIIIIIMIIIM 

Db 419 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 457 



RESULT 5 
AW958056 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AW958056 505 bp mRNA linear EST Ol-JUN-2000 

EST37 0126 MAGE resequences, MAGE Homo sapiens cDNA, mRNA sequence. 
AW958056 

AW958056.1 01:8147739 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 505) 

Hegde,P., Qi,R., Abernathy, K. , Dharap.S., Gaspard,R., Gay,C., Holt 
,I.E., Saeed,A.I., Sharov,V., Lee,N.H., Yeatman,T.J. and 
Quackenbush, J. 

Assessment of gene expression patterns in a model of colon tumor 

metastasis using a 19,200 element cDNA microarray 

Unpublished 

Contact : John Quackenbush 

The Institute for Genomic Research 

9712 Medical Center Dr., Rockville, MD 20850, USA 

Tel: 301 838 3528 

Fax: 301 838 0208 

Ema i 1 : j ohnq@t igr , org 

Plate: 115 

Seq primer: Reverse. 

Location/Qualifiers 

1. .505 

/organism= " Homo sapi ens " 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone_lib="MAGE resequences, MAGE" 
/note= "Vector : pBluescriptSKm" 
115 a 137 c 106 g 144 t 3 others 



Query Match 85.0%; Score 308.6; DB 10; Length 505; 

Best Local Similarity 94,4%; Pred . No. 4.1e-66; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 



Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll IIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIII 

Db 9 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 68 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

llllllllllllll llllllll llllllllllllll M II lllllllllllllll 
Db 69 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 128 

Qy 14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

Mill llllllll llllllll llllllll llllllll Mill llllllllllll 

Db 129 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 188 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

MMMM IIIIIIIIIMIII IIIIIIIMII IIIMIIMIIIIIIIIIIIIIII 

Db 189 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 248 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

llllllllllllll IIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIMIII 

Db 24 9 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 308 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIIIIMIIIIIIMIIIII IIIIMIIIIIIIMIII 

Db 309 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 347 



RESULT 6 
BQ222205 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BQ222205 536 bp mRNA linear EST 02-MAY-2002 

AGENCOURT_72 58 5 95 NIH_MGC_71 Homo sapiens cDNA clone IMAGE : 5786680 
5', mRNA sequence, 
BQ222205 

BQ222205 . 1 GI : 20403594 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Euthena; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 536) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpubl ished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.qov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Life Technologies, Inc. 
CDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort lum/LLNL at: 

http: //image . llnl .gov 

Plate: LLAM12875 row: m column: 17 

High quality sequence stop: 535. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Loca t i on/Qua 1 i f i ers 
1. .536 

/organ ism= "Homo sapiens" 

/mol__type="mRNA" 

/db_xref ="taxon: 9606" 

/clone=" IMAGE: 5786680" 

/ 1 issue_type= " leiomyosarcoma " 

/lab_host="DH10B (phage-resistant ) " 

/clone_lib="NIH_MGC_71" 

/note="Organ: uterus; Vector: pCMV-SP0RT6; Site_l: NotI; 
Site_2: Sail; Cloned unidirect ionally . Primer: Oligo dT. 
Average insert size 2.1 kb. " 
113 a 193 c 150 g 77 t 3 others 



Query Match 85.0%; Score 3 08.6; DB 13; Length 536; 

Best Local Similarity 94.4%; Pred. No. 4.2e-66; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; 



Gaps 



0; 



Qy 

Db 



25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

IIIIMII llllllll IIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIMIIIIII 

3 6 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 95 



Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



85 



96 



145 



TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

llllllllllllll llllllll MIIIIIIIIIIM II II IIMIIMIIIIIII 

TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 155 



GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

Mill llllllll llllllll llllllll llllllll Mill llllllllllll 

15 6 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 215 

205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

llllllll IIIIIMIIMIII lllllllllll llllllllllllllllllllllll 
216 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 275 



265 



276 



325 



324 



CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 

llllllllllllll IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 335 



363 



CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 

IIIIIIIIIMIIIIIIIII Mllllllllllllllll 
336 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 374 



RESULT 7 

BM670402/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 



BM670402 538 bp mRNA linear EST 27-FEB-2002 

UI-E-DWl-ahc-l-03-O-UI .si UI-E-DWl Homo sapiens cDNA clone 
UI-E-DWl-ahc-l-03-O-UI 3', mRNA sequence. 
BM670402 

BM670402 .1 GI ; 18980299 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 538) 



AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Bonaldo , M . F . , Lennon , G - and Scares , M . B . 

Normalization and subtraction: two approaches to facilitate gene 

discovery 

Genome Res. 6 (9), 791-806 (1996) 

97044477 

8889548 

Contact: Scares, MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF, Iowa City, lA 52242, USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email : bento-soares®uiowa . edu 

Tissue Procurement: Dr. Gregg Hageman 
cDNA Library preparation: Dr. M. Bento Soares, Univeristy of Iowa 
cDNA Library Arrayed by: Dr. M» Bento Soares, Univeristy of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, Univeristy of Iowa 
Clone Distribution: Researchers may obtain clones from Research 

Genetics (www.resgen.com) . 
The following repetitive elements were found in this cDNA 

sequence : 97-152 , >GC_rich#Low_complexity 

Seq primer: M13 Forward 

POLYA=Yes - 

Location/Qualifiers 
1. .538 

/organ ism= "Homo sapiens" 
/mol__type=" mRNA" 
/db_xref ="taxon: 9606" 
/clone="UI-E-DWl-ahc-l-03-0-UI" 
/t issue_type=" lens " 
/ dev_s tage= " adul t " 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib="UI-E-DWl" 

/note="Organ : eye; Vector: pT7T3-Pac (Pharmacia) with a 
modified polyl inker; Site_l: EcoR I; Site_2: Not I; 
UI-E-DWl is a normalized cDNA library containing the 
following tissue(s): lens. The library was constructed 
according to Bonaldo, Lennon and Soares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned directionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first -strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT) 18 tail. The 
sequence tag for this library is CGATTAGCGA. This library 
was created for the program. Gene Discovery in the Visual 
System, supported by National Eye Institute (NEI). 
TAG_LIB=UI-E-DW1 
TAG_TISSUE=human lens 
TAG_SEQ= CGATTAGCGA " 
78 a 153 c 195 g 112 t 



Query Match 85.0%; 
Best Local Similarity 94.4%; 
Matches 320; Conservative 



Score 3 08.6; DB 12; Length 538; 
Pred. No. 4.2e-66; 
0; Mismatches 19; Indels 0; Gaps 0; 



Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll lllllllllllllllllllllllllllllllllllllllll 
Db 4 94 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 43 5 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

llllllllllllll llllllll llllllllllllll II II lllllllllllllll 
Db 4 34 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 375 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 2 04 

Mill llllllll llllllll llllllll llllllll Mill IMIIIIIIIII 
Db 374 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 315 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

IIIIIIM IIIMIIIIIIIII lllllllllll IIIIIIIIIIIIIIIIIIIIMII 

Db 314 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 255 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIMIIIMIIIII IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 254 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 195 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGG^GTGCAGCTGA 363 

Illlllllllllllllllll IIIIIIIIIIIMIIIII 

Db 194 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 156 



RESULT 8 
BI834732 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BI834732 540 bp mRNA linear EST 04-OCT-2001 

603090326F1 NIH_MGC_120 Homo sapiens cDNA clone IMAGE : 5229246 5', 
mRNA sequence. 
BI834732 

BI834732.1 01:15946269 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 540) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/ LLNL at: 
http: // image. llnl .gov 
Plate: LLAM11577 row: c column: 07 
High quality sequence stop: 535. 

Locat ion/ Qual i f i ers 

1. .540 

/organism="Homo sapiens" 
/mol_type="mRNA" 



BASE COUNT 
ORIGIN 



/db_xref ="taxon: 9606" 
/clone=" IMAGE: 5229246" 
/lab_host=" DHIOB" 
/ cl one_l ib= " NI H_MGC_1 2 0 " 

/note= "Organ: pooled pancreas and spleen; Vector: 
pCMV-SP0RT6; Site_l : Not I; Site_2: EcoRV (destroyed); RNA 
source anonymous pool of spleen and pancreas from 28 yo 
male. Library is oligo-dT primed and direct ionally cloned 
(EcoRV site is destroyed upon cloning) . Average insert 
size 1.5 kb, insert size range 1-2.5 kb. Library is 
normalized and enriched for full-length clones and was 
constructed by C. Gruber (Invitrogen) . Research Genetics 
tracking code 025. Note: this is a NIH_MGC Library." 
104 a 188 c 154 g 94 t 



Query Match 85.0%; 
Best Local Similarity 94.4%; 
Matches 320; Conservative 



Score 308.6; DB 12; 
Pred. No. 4,2e-66; 
0; Mismatches 19; 



Length 54 0; 
Indels 0; 



Gaps 



0; 



Qy 



Db 



25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll IMMMMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

191 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 250 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 



85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

llllllllllllll llllllll llllllllllllll II II lllllllllllllll 
251 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 310 

14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mil! llllllll llllllll llllllll llllllll Mill llllllllllll 
311 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 370 

205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

llllllll llllllllllllll lllllllllll Mllllllllllllllllllllll 

371 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 43 0 

265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

Mllllllllllll lllllllllllllllllllllllllllllllllllllllllllll 
431 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 490 



325 



363 



CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 

llllllllllllllllllll IIIIMIIIIIIIIIIM 

4 91 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 52 9 



RESULT 9 

BQ003136/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



BQ003136 593 bp mRNA linear EST 26-MAR-2002 

UI-H-EIl-ayx-d-19-O-UI .si NCI_CGAP_EI1 Homo sapiens cDNA clone 
IMAGE: 5844834 3', mRNA sequence. 
BQ003136 

BQ003136 .1 GI : 19728036 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



REFERENCE 1 (bases 1 to 593) 

AUTHORS NCI-CGAP http: //www.ncbi .nlm.nih.gov/ncicgap. 

TITLE National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 
JOURNAL Unpubl ished 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs -r@mail . nih . gov 

Tissue Procurement: Dr. Jose Mercuende 
cDNA Library preparation: Dr. M. Bento Scares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Clone distribution information can be found 

through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
The following repetitive elements were found in this cDNA 

sequence : 97-152 , >GC_rich#Low_complexity 

Seq primer: M13 FORWARD 

POLYA=Yes . 

FEATURES Locat ion/Qua 1 i f i ers 

source 1. .593 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/ db_xr e f = " t a xon : 9 6 0 6 " 

/clone=" IMAGE: 5844834" 

/ 1 i ssue__type= " Chondrosa rcoma " 

/dev_s tage= " Adul t " 

/lab_host="DH10B (Life Technologies)" 
/ clone^l ib= "NCI_CGAP_EI 1 " 

/note="Organ: Left Pelvis; Vector: pT7T3-Pac (Pharmacia) 
with a modified polylinker; Site_l: EcoR I; Site_2: Not I; 
NCI_CGAP_EI1 is a normalized cDNA library containing the 
following tissue (s): Chondrosarcoma. The library was 
constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. First strand cDNA 
synthesis was primed with an oligo-dT primer containing a 
Not I site. Double stranded cDNA was ligated to an EcoR I 
adaptor, digested with Not I, and cloned directionally 
into pT7T3-Pac vector. The oligonucleotide used to prime 
the synthesis of first-strand cDNA contains a library tag 
sequence that is located between the Not I site and the 
(dT)18 tail. The sequence tag for this library is 
ACACTTGCAC. 
TAG_LIB=UI-H-EI1 
TAG_TISSUE=chondrosarcoma 
TAG_SEQ=ACACTTGCAC " 

BASE COUNT 91 a 164 c 217 g 121 t 

ORIGIN 



Query Match 85.0%; Score 308.6; DB 12; Length 593; 

Best Local Similarity 94.4%; Pred. No. 4.4e-66; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll IIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIMIIIIII 

Db 4 94 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 435 

Qy 85 TACIATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IIIIIMIIIIIII IIIIIIM IMIIIIIIIIIM II II lllllllllllllll 



Db 



434 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 375 



Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



14 5 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

lllll MMIIM Illlllll Illlllll Illlllll IMII llllllllllll 

374 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 315 

205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

Illlllll llllllllllllll IIIIIIIMII llllllllllllllllllllllll 
314 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCX3TGCCG 255 

265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCyU^ 324 

llllllllllllll IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 
254 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 195 

325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

MIMIIIMIIIIIIIMI IMIMlllllillUII 

194 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 156 



RESULT 10 

BI905963 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BI905963 599 bp mRNA linear EST 16-OCT-2001 

603062849F1 NIH_MGC_118 Homo sapiens CDNA clone IMAGE: 5211925 5', 
mRNA sequence. 
BI905963 

BI 905963 .1 GI : 16168598 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 599) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Ema i 1 : cgapbs - r®ma i 1 . nih . gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The l.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution; MGC clone distribution information can be 
found through the l.M.A.G.E. Consort ium/LLNL at: 
http: //image. llnl .gov 
Plate: LLAM11532 row: a column: 14 
High quality sequence stop: 567. 

Location/Qualif iers 

1. .599 

/organ i sm= " Homo sap i ens " 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone="IMAGE: 5211925" 
/ 1 i s sue_t ype= " 1 eukocy t e " 
/lab_host="DH10B" 
/clone_l ib= " NI H_MGC_1 1 8 " 

/note= "Vector: pCMV-SP0RT6 ; Site^l: Not I; Site_2 : EcoRV 
(destroyed) ; RNA source leukocytes from anonymous pool of 



BASE COUNT 
ORIGIN 



non-activated adult donors. Library is oligo-dT primed 
and direct ional ly cloned (EcoRV site is destroyed upon 
cloning) , Average insert size 1.7 kb, insert size range 
1.2-3.3 kb. Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
(Invitrogen) . Research Genetics tracking code 027. Note: 
this is a NIH_MGC Library." 
112 a 201 c 181 g 105 t 



Query Match 85.0%; 
Best Local Similarity 94.4%; 
Matches 320; Conservative 



Score 308.6; DB 12; Length 599; 
Pred. No. 4.4e-66; 
0; Mismatches 19; Indels 0; 



Gaps 



0; 



Qy 



Db 



25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

IIIIIMI llllllll lllllllllllllllllllllllllllllllllllllllll 

233 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 2 92 



Qy 

Db 

Qy 

Db 



85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

llllllllllllll llllllll llllllllllllll II II IIIIIIIIIMIIII 
293 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 352 



145 



204 



GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 

Mill llllllll llllllll llllllll llllllll Mill llllllllllll 

353 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 412 



Qy 

Db 

Qy 

Db 

Qy 
Db 



205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

llllllll llllllllllllll IIMIIIIIII llllllllllllllllllllllll 
413 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 472 



265 



324 



CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 

llllllllllllll lllllllllllllllllllllllllllllllllllllllllllll 
473 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 532 



325 



363 



CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 

llllllllllllllllllll llllllllllllllllll 
533 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 571 



RESULT 11 
BF726995/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BF726995 600 bp mRNA linear EST 05-JAN-2001 

byl5c03.yl Human Lens cDNA (Un-normalized, unamplified) : BY Homo 
sapiens cDNA clone byl5c03 5', mRNA sequence. 

BF726995 

BF726995.1 GI : 12 042 906 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 600) 

Wistow.G.J., Bernstein, S. , Behal^A. and Smith, D. 
NEIBANK: EST analysis and bioinformat ics for ocular genomics 
Invest. Ophthalmol. Vis. Sci. 41, (2000) In press 
Contact: Wistow G 

Section on Molecular Structure and Function 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



National Eye Institute 

6/331, NIH, Bethesda, MD 20892-2740, USA 

Tel: 301 402 3452 

Fax: 301 496 0078 

Email : graeme@helix.nih.gov 

Plate: 15 row: c column: 03 

Seq primer: M13RP1 reverse primer (ABI) . 

Locat ion/ Qual i f i ers 

1. .600 

/organism="Homo sapiens" 

/mol_type= " mRNA" 

/db_xref="taxon: 9606" 

/clone="byl5c03" 

/ 1 i ssue_type= " Lens " 

/ devest age= "Adul t " 

/la b_ho s t = " EMDH 1 0 B " 

/clone_lib= "Human Lens cDNA ( Un -norma 1 i zed, unamplified) : 

BY" 

/note= "Organ: Eye; Vector: pCMVSP0RT6 ; Two human lenses 
from different adults (both approximately 40 years old) 
together yielded 20ug of total RNA and 150ng mRNA for cDNA 
library synthesis. A direct lonal ly cloned cDNA library m 
the pCMVSP0RT6 vector was constructed at Life Technologies 
, essentially following the protocols of the Superscript 
Plasmid System full details of which are contained in the 
manufacturer's Instruction manual 

(http://www.lifetech.com/). First strand synthesis was 
carried out using a Not I primer-adapter 

[5'-pGACTAGTTCTAGATCGCGAGCGGCCGCCC(T)15-3'] . Not l/blunt 
end inserts were cloned into the Not l/EcoR V sites in the 
vector. EST analysis was performed on the unamplified 
library at the NIH Intramural Sequencing Center (NISC) ." 
91 a 170 c 216 g 122 t 1 others 



Query Match 85.0%; 

Best Local Similarity 94.4%; 
Matches 32 0; Conservative 



Score 308.6; DB 10; Length 600; 
Pred. No. 4.4e-66; 
0; Mismatches 19; Indels 0; Gaps 



0; 



Qy 



Db 



25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll llllllllllllMIIIIIMMIIIIIIIIIIIIIIilll 

512 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 453 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 



85 



452 



145 



TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 

IMIIIIIIIIIII llllllll IIIIIIIIIIIIM II II IIIMIIIIIIIIII 

TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 



144 



393 



204 



GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 

Mill llllllll llllllll llllllll llllllll Mill IMIIIIIIIII 
3 92 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 333 



205 



264 



GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 

llllllll llllllllllllll IMIIIIIIII IMIIIIIIIMIIIIIIIIIIII 

332 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 273 



2 65 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 

MIMIIIIIIMI IIIIMIIIIIIMMIIIIIIIIIIIIIIIIMMIIIIIIIII 



324 



Db 272 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 213 

Qy 32 5 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

llllllllllllllllllll llllllllllllllllll 
Db 212 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 174 



RESULT 12 
BM674987/C 

LOCUS BM674987 616 bp mRNA linear EST 27-FEB-2002 

DEFINITION UI-E-EJO-ahn-k-lO-O-UI .si UI-E-EJO Homo sapiens cDNA clone 

UI-E-EJO-ahn-k-lO-O-UI 3', mRNA sequence. 
ACCESSION BM674 987 

VERSION BM674987.1 01:18984885 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 616) 

AUTHORS Bonaldo^M.F. , Lennon,G. and Soares,M.B. 

TITLE Normalization and subtraction: two approaches to facilitate gene 
discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
MEDLINE 97044477 
PUBMED 8889548 
COMMENT Contact: Soares , MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF, Iowa City, lA 52242, USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email : bento-soares@uiowa . edu 

Tissue Procurement: Dr. Gregg Hageman 
cDNA Library preparation: Dr. M. Bento Scares, Univeristy of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Scares, Univeristy of Iowa 
DNA Sequencing by: Dr. M. Bento Scares, Univeristy of Iowa 
Clone Distribution: Researchers may obtain clones from Research 

Genetics (www.resgen.com) . 
The following repetitive elements were found in this cDNA 

s equence : 97-152, >GC_r ich#Low_comp 1 ex i t y 

Seq primer: M13 Forward 

POLYA=Yes . 

FEATURES Location/Qualifiers 
source 1. .616 

/organ ism=" Homo sapiens" 

/mol_type="mRNA" 

/db xref ="taxon: 9606" 

/clone="UI-E-EJ0-ahn-k-10-0-UI " 

/tissue_type=" fetal eyes, lens, eye anterior segment, 
optic nerve, retina, Retina Foveal and Macular, RPE and 
Choroid" 

/dev_stage=" fetal and adult" 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib= "UI-E-EJO" 

/note="Organ: eye; Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker; Site___l: EcoR I; Site_2: Not I; 



BASE COUNT 
ORIGIN 



UI-E-EJO is a subtracted cDNA library constructed 
according to Bonaldo, Lennon and Scares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned directionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT)18 tail. The 
sequence tags for this library are: fetal eyes, AGAATCAAGA 
; lens, CGATTAGCGA; eye anterior segment, AATGCCGCAT; 
optic nerve, CCATTAAGTG; retina, CCGCG; Retina Foveal and 
Macular, GTCC; RPE and Choroid, ACCTA. This library was 
created for the program. Gene Discovery in the Visual 
System, supported by National Eye Institute (NEI) . 
TAG_LIB=UI-E-EJO 
TAG_TISSUE=RPE and Choroid 
TAG_SEQ=ACCTA" 
92 a 175 c 225 g 124 t 



Query Match 85.0%; 
Best Local Similarity 94.4%; 
Matches 320; Conservative 



Score 3 08 . 6; DB 12; 
Pred. No. 4.4e-66; 
0; Mismatches 19; 



Length 616; 
Indels 0; 



Gaps 



0; 



Qy 

Db 



25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

494 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 435 



Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



8 5 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

llllllllllllll IIMIIII llllllllllllll II II lllllllllllllll 
434 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 375 

145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

lllll llllllll llllllll llllllll llllllll Mill llllllllllll 
374 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 315 



205 



264 



GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 

llllllll llllllllllllll lllllllllll IIIMMIIIIMIIIIIIIIIM 

314 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 255 



265 



324 



CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 

llllllllllllll IIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIM 

254 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 195 



325 



363 



CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 

MIIIIIIIMMIIIIIII llllllllllllllllll 

194 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 156 



RESULT 13 

CA426732/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 



CA426732 625 bp mRNA linear EST 07-NOV-2002 

UI-H-FEl-bei-e-23-O-UI .si NCI_CGAP_FE1 Homo sapiens cDNA clone 
UI-H-FEl-bei-e-23-O-UI 3', mRNA sequence. 
CA426732 

CA426732 . 1 GI :24789458 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 

AUTHORS 
TITLE 

JOURNAL 
COMMENT 



EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 625) 

NCI-CGAP http: //www.ncbi .nlm.nih.gov/ncicgap. 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact; Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: James Martin 

cDNA Library preparation: Dr. M. 

cDNA Library Arrayed by: Dr. M. 



FEATURES 

source 



, University of Iowa 
University of Iowa 



BASE COUNT 
ORIGIN 



Bento Soares, 

Bento Soares, 

DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Clone distribution information can be obtained 
from Dr. M. Bento Scares, bento-soares@uiowa.edu 

The following repetitive elements were found in this cDNA 
sequence : 97- 152 , >GC_rich#Low_complexity 
Seq primer: M13 FORWARD 
POLYA=Yes . 

Loca t ion/ Qua 1 i f i er s 
1. .625 

/organ ism= "Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="UI-H-FEl-bei-e-23-0-UI " 

/tissue_type="Cell lines" 
/dev_stage= " Adul t " 

/lab_host="DH10B (Life Technologies)" 
/clone_lib="NCI_CGAP_FEl" 

/note="Organ: Chondrosarcoma; Vector: pT7T3-Pac (Pharmacia 
) with a modified polylinker; Site_l: EcoR I; Site_2 : Not 
I; NCI_CGAP_FE1 is a normalized cDNA library derived from 
a pool of mRNA obtained from 3 cell lines from grade II 
chondrosarcoma tissues. The library was constructed 
according to Bonaldo, Lennon and Soares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned direct lonal ly into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT) 18 tail. The 
sequence tag for this library is CGCTACGGAC. The cell 
lines were provided by Dr James Martin from the University 
of Iowa. 

TAG_LIB=UI-H-FE1 

TAG_TlSSUE=Human grade 2 chondrosarcoma cell line pool 
TAG_SEQ=CGCTACGGAC" 
95 a 175 c 229 g 126 t 



Query Match 85.0%; 
Best Local Similarity 94.4%; 
Matches 320; Conservative 



Score 308.6; DB 14; Length 625; 
Pred. No. 4.4e-66; 
0; Mismatches 19; Indels 0; Gaps 0; 



Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll MIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 4 94 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 435 

Qy 8 5 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

llllllllllllll llllllll llllllllllllll II II lllllllllllllll 
Db 434 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 375 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill llllllll llllllll llllllll llllllll Mill IIIMIIIIIII 

Db 374 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 315 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

MMIIM llllllllllllll IIIIIIIIMI IIIMIIIIIIIIIIIIIIIIIM 

Db 314 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 255 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

llllllllllllll IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIII 
Db 254 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 195 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 3 63 

llllllllllllllllllll IIMIIIIIIIIIIIIM 

Db 194 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 156 



RESULT 14 
CA423539/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



CA423539 649 bp mRNA linear EST 07-NOV-2002 

UI-H-FEl-bec-p-13-O-UI .si NCI_CGAP_FE1 Homo sapiens cDNA clone 
UI-H-FEl-bec-p-13-O-UI 3', mRNA sequence. 
CA423539 

CA423539.1 GI :24786265 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 649) 

NCI-CGAP http: //www.ncbi .nlm.nih.gov/ncicgap. 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: James Martin 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Clone distribution information can be obtained 

from Dr. M. Bento Soares, bento - soares®uiowa . edu 
The following repetitive elements were found in this cDNA 

s equen ce: 97-152, >G C_r i ch # Low_comp 1 ex i t y 

Seq primer: M13 FORWARD 

POLYA=Yes . 

Loca t i on/ Qua 1 i f i er s 
1. .649 



pT7 T3 - Pa c ( Pha rma c i a 

ECOR I; Site_2: Not 



BASE COUNT 
ORIGIN 



/organism="Homo sapiens" 
/mo 1 _type = " mRNA " 
/db_xref ="taxon: 9606" 
/clone="UI-H-FEl-bec-p-13-0-UI" 
/tissue_type="Cell lines" 
/dev_stage= "Adult" 

/lab_host="DH10B (Life Technologies)" 
/clone__lib="NCI_CGAP_FEl" 
/note= "Organ : Chondrosarcoma ; Vector : 
) with a modified polylinker; Site_l : 
I; NCI_CGAP_FE1 is a normalized cDNA library derived from 
a pool of mRNA obtained from 3 cell lines from grade II 
chondrosarcoma tissues. The library was constructed 
according to Bonaldo, Lennon and Scares, Genome Research, 
6:791-806, 1996, First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned direct ionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT) 18 tail. The 
sequence tag for this library is CGCTACGGAC. The cell 
lines were provided by Dr James Martin from the University 
of Iowa. 

TAG_LIB=UI-H-FE1 

TAG_TISSUE=Human grade 2 chondrosarcoma cell line pool 
TAG_SEQ=CGCTACGGAC " 
99 a 179 c 234 g 137 t 



Query Match 85 . 0%; 

Best Local Similarity 94.4%; 
Matches 32 0; Conservative 



Score 308 .6; DB 14; 
Pred. No. 4.5e-66; 
0; Mismatches 19; 



Length 64 9; 



Indels 



0 ; Gaps 



Qy 



Db 



25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

llllllll llllllll lllllllllllllllllllllllllllllllllllllllll 
4 94 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 435 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

IIIMMMMMI IIIIMII IIIIIIIIIIMM II II 1 1 1 1 1 1 1 1 II M 1 1 1 

434 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 375 



145 



204 



GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 

IMII IMIIIII llllllll IIIIMII llllllll Mill llllllllllll 

374 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 315 



205 



264 



GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 

II Ml Ml IIIIIIMIIIMI II Mill MM MM MM II MM II IMII! II 

314 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 255 



265 



324 



CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 

MMMMMMM I II I II M 1 1 II M II I II 1 1 II M I II II I M I II I II II I II I 

254 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 195 



325 



363 



CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 

MMMMMMMMMM M I M II 1 1 1 1 M 1 1 II I 

194 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 156 



RESULT 15 
BM681750/C 

LOCUS BM681750 654 bp mRNA linear EST 27-FEB-2002 

DEFINITION UI-E-EJO-aik-b-22-O-UI .Si UI-E-EJO Homo sapiens cDNA clone 

UI-E-EJO-aik-b-22-O-UI 3', mRNA sequence. 
ACCESSION BM681750 

VERSION BM681750.1 01:18991646 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 654) 

AUTHORS Bona 1 do , M . F . , Lennon , G . and Soares , M . B . 

TITLE Normalization and subtraction: two approaches to facilitate gene 

discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
MEDLINE 97044477 
PUBMED 8 88 954 8 
COMMENT Contact : Soares , MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF, Iowa City, lA 52242, USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email : bento-soares®uiowa .edu 

Tissue Procurement: Dr. Gregg Hageman 
cDNA Library preparation: Dr. M. Bento Soares, Univeristy of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, Univeristy of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, Univeristy of Iowa 
Clone Distribution: Researchers may obtain clones from Research 

Genetics (www.resgen.com) . 
The following repetitive elements were found in this cDNA 

sequence : 97-152 , >GC_rich#Low_complexity 

Seq primer: M13 Forward 

POLYA=Yes . 

FEATURES Loca t i on/Qua 1 i f i er s 

source 1. .654 

/organ i sm= " Homo sap i ens " 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="UI-E-EJ0-aik-b-22-0-UI " 

/tissue_type=" fetal eyes, lens, eye anterior segment, 
optic nerve, retina, Retina Foveal and Macular, RPE and 
Choroid" 

/dev_staqe=" fetal and adult" 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib=" UI-E-EJO" 

/note="Organ: eye; Vector: pT7T3-Pac (Pharmacia) with a 
modified polyl inker; Site_l: EcoR I; Site_2: Not I; 
UI-E-EJO is a subtracted cDNA library constructed 
according to Bonaldo, Lennon and Soares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 



with Not I, and cloned directionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT) 18 tail. The 
sequence tags for this library are: fetal eyes, AGAATCAAGA 
; lens, CGATTAGCGA; eye anterior segment, AATGCCGCAT; 
optic nerve, CCATTAAGTG; retina, CCGCG; Retina Foveal and 
Macular, GTCC; RPE and Choroid, ACCTA. This library was 
created for the program, Gene Discovery in the Visual 
System, supported by National Eye Institute (NET) . 
TAG_LIB=UI-E-EJO 
TAG_TISSUE=human lens 
TAG_S EQ = CGATTAG CGA " 

BASE COUNT 100 a 182 c 234 g 138 t 

ORIGIN 

Query Match 85.0%; Score 308.6; DB 12; Length 654; 

Best Local Similarity 94.4%; Pred. No. 4.5e-66; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 25 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

MIMMI IIIIIMI MIIIMIIMIIIIIIIMIIIIIMIIIIIMMIIMI 

Db 4 94 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 435 

Qy 85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

llllllllllllll IIIIIIM IIIIIIMIIIIII II M lllllllllllllll 

Db 434 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 375 

Qy 145 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 204 

Mill IIIIIIM IIIIIMI MIMMI IIIIIIM Mill llllllllllll 

Db 374 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 315 

Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

MIMMI IMMIIIIIIIM IMIMMIII MIMIIIIIIIIIIIIMIMM 

Db 314 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 255 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

IIIIIMIIIIIII IIIIIIIIIIIIIMIIIIIIIMIIIIIIIIMIIIIIIIIIM 

Db 254 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 195 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIIIIIIIIIIIIIMIIII IIIIIIIMIMIIIIII 

Db 194 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 156 
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